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423 Identification and Mutational Analysis of Major Epitopes of the 
Shrimp Allergen Pen a 1 (Tropomyosin) R Ayuso, G Reese, SM 
Leong-Kee, MJ Plante, SB Lehrer Tulanc University Medical Cen- 
ter, New Orleans, LA, USA 

The major allergen of shrimp, identified as the muscle protein 
tropomyosin, belongs to a family of highly conserved proteins with 
multiple isoforms found in both muscle and non-muscle cells of all 
species of vertebrates and invertebrates. Allergenic tropomyosins are 
found in invertebrates such as cmstaceans (shrimp, lobster, crab, 
crawfish), arachnids (house dust mites), insects (cockroaches), and 
moUusks (e.g. squid) whereas vertebrate tropomyosins are consid- 
ered to be non-a!lergenic. Five major IgE-binding regions of Pen a 1 
(region 1; 43-57, region 2: 85-105, region 3: 133-153, region 4; 187- 
201 and region 5: 247-284) were identified using synthetic, overlap- 
ping peptides (15 amino acid long, offset of 6 residues) and sera of 18 
shrimp-allergic subjects. The minimal epitopes (shortest peptides 
with maximal IgE antibody reactivity) of three (regions 1 , 3, and 5) of 
the five major IgE binding regions were characterized. Each subject 
showed an individual reactivity pattern; not all epitopes were detected 
by all subjects. In region 1, one IgE-binding epitope, epitope 1 (43- 
55) was recognized with maximal IgE antibody reactivity by all four 
region I -reactive subjects. Within region 3, two epitopes were identi- 
fied centered around the cores 3a (137-141) and 3b (144-15 1) with 
different lengths (8-15 amino acids) recognized in different subjects 
(n = 7). In region 5, three epitopes were identified: 5a, 5b, and 5c. 

Epitope 5a is centered around a common core (251-259), with a 
little variability in the length of the epitopes recognized by individual 
subjects. Epitope 5b (266-273) is identical for all 4 region 5-reactive 
subjects and epitope 5c (273-281) was recognized only by one sub- 
ject. The effects of amino acid substitutions on the IgE binding capac- 
ity of identified epitopes were tested by comparing IgE antibody reac- 
tivities of shrimp-allergic subjects to identified Pen a 1 epitopes with 
those of homologous sequences of various non-allergen ic vertebrate 
tropomyosins. In addition, all possible substitution combinations that 
resulted from the differences between Pen a 1 epitopes and the 
homologous vertebrate sequences of up to five substitutions per pep- 
tide were tested. One or two substitutions may eliminate, reduce, not 
alter or even increase the IgE antibody reactivity whereas more than 
two substitutions abolished the IgE binding ability of modified Pen a 
1 peptides. In general, substitutions in the center of the epitope 
sequence are more likely to eliminate (59.5%) IgE binding as com- 
pared to substitutions in the peripheral parts of the epitopes (39.1%). 
Most of the single substitutions that eliminate IgE binding are non- 
conservative (63.5%) as compared to 23. 1 % of conservative substitu- 
tions. Thus, single amino acid non-conservative substitutions in a 
core part of an epitope are most likely to significantly reduce or abol- 
ish IgE antibody binding and may be useful in future development of 
safer allergen vaccines or less allergenic foods. 

424 Cross-Reactivity of Walnut and Other Tree Nuts by IgE 
Immunoblot Inhibition SS Teuber*f, SK Sathetl KH Rouxt, WR 
Peterson* ^University of California, Davis, School of Medicine, 
Davis, CA t Veterans Affairs Northern California Healthcare System, 
Mather, CA ^Florida State University, Tallahassee, FL 

Clinically, patients with walnut food allergy often report hyper- 
sensitivity to more than one nut. In our experience, this often includes 
pecan, which is in the same botanical family as walnut. In some 
patients with life-threatening reactions to walnuts, the clinical reac- 
tion to other nuts is much less severe (localized itching of the mouth, 
for instance), but others report reactions of the same severity. We 
wished to investigate the cross-reactivity between tree nuts by IgE 
immunoblot inhibition in order to see if there were generalizations 
that could be made about particular tree nuts and the potential to 
cross-react with walnut. Sera from patients with life-threatening wal- 
nut food allergy were used in Western IgE immunoblotting against 
walnut. The sera were diluted so that IgE binding to walnut proteins 



was visible, but not intense, maximizing the potential for other allergen 
sources to absorb out specific IgE. Sera were then incubated overnight 
with increasing concentrations of walnut extract (the homologous posi- 
tive control), ovalbumin (negative control), almond, pecan, cashew and 
hazelnut extracts. Sera were then used in standard IgE immunoblotting 
to walnut protein extract separated by SDS-PAGE under reducing con- 
ditions (albumin/giobulin fraction). The immunobiots showed no inhi- 
. bition of specific IgE by the negative control, ovalbumin. Significantly 
however, pecan extract was able to absorb out IgE to walnut on a basis 
equivalent to that of walnut itself, at the lowest concentration of 1 
mcg/mL. For each of the other nuts though, much variability in the 
capacity of other nut proteins to absorb out walnut-specific IgE was 
seen. In summary, pecan appears to be totally cross- reactive with wal- 
nut at even the lowest concentrations used, and this correlated with the 
clinical reports of cross-reactivity in these patients. 

425 Peanut Protein Digestibility: A Gastric and Intestinal Model CL 

SeUers*t, SS Teubert, BB Buchanan*, I Chen'' ^University of Cal- 
ifornia at Berkeley, Department of Plant and Microbial Biology, 
Berkeley, CA tCerro Cosos College, Ridgecrest, CA ^lUniverslty of 
California, Davis, School of Medicine 

Peanut allergy is typically life-long and can be a cause of fatal ana- 
phylaxis. The purpose of this study was to determine the digestibility of 
peanut allergens. Crude protein exh"acts of peanut were exposed to 
pepsin in simulated gastric fluid (SGF). The digests were subjected to 
SDS-PAGE and IgE immunoblotting using sera from patients with life- 
threatening reactions to peanuts. Allergenic proteins of the crude extract 
which were not digested by pepsin in SGF were isolated by gel filtration 
chromatography. Pepsin-resistant allergens were reduced with dithio- 
threitol (DTT) and exposed to pepsin in SGR Pepsin-resistant isolates, 
in the oxidized state, were also exposed to pancreatin in simulated 
intestinal fluid (SIF). Aliquots were taken and digestion was stopped at 
0, 0.25, 1, 2, 4, 15, 30, and 60 minutes."* Reduced proteins exposed to 
pepsin were subjected to SDS-PAGE. Proteins exposed to pancreatin 
were subjected to SDS-PAGE, then transferred to niu-ocellulose for IgE 
immunoblotting. Results: Three low molecular weight proteins, 14, 17 
and 18 kDa were the only proteins in the crude extracts resistant to 
pepsin digestion. N-terminal sequencing revealed the 18 and 17 kDa 
proteins to be the peanut 2S protein, Ara h 2. The 1 4 kDa band was found 
to consist of multiple proteins. Ara h 2 was highly resistant to pepsin in 
its oxidized form, remaining intact in the gel after 30 min of exposure. 
The 14 kDa band displayed even greater resistance, remaining in the gel 
after 60 min of exposure. Reduction of Ara h 2 and the 1 4 kDa proteins 
rendered the proteins digestible by pepsin after 1 min of exposure. Ara h 
2 and the 14 kDa proteins were digested by pancreatin in SIF as assessed 
by IgE immunoblotting by 2 min and 15 min, respectively. In summary, 
Ara h 2 and the 14 kDa fraction were the peanut allergens most resistant 
to pepsin digestion, but were digested in SIF, though not rapidly. Inter- 
estingly though, Ara h I, a 63 kDa major allergen, was digested after a 
mere 15 seconds of exposure to pepsin in SGF. 

426 Identification of Tropomyosin as Mollusk Allergens in Abalone, 
Scallop and Mussel PSC Uung*, SH Wongf, KH Chuf *Division 
of Rheumatology, Allergy and Clinical Immunology, School of 
Medicine, University of California, Davis, CA, USA tDepartmeni 
of Biology, The Chinese University of Hong Kong, Shatin, Hong 
Kong, China 

Shellfish is one of the most common food allergens. Recently, 
tropomyosin was identified as the major heat stable crustacean aller- 
gen in shrimp, lobster and crab. In addition to crustaceans, many 
species of mollusks are commonly consumed all over the world. 
Interestingly, a significant portion of shellfish allergic populations is 
also hypersensitive to mollusks. Several mollusk allergens including 
Tod p I 'm squid, Hal m I m abalone and Cra g I 'm oyster have been 
characterized using biochemical techniques. The objective of the 
present study is to determine whether tropomyosin is a candidate 



StiC-ILL 



From: ^ 

Sent: 

To: 

Subject: 
importance: 



Huynh, Phuong N. 

Tuesday, September 21, 2004 4:59 PM 
STIC-ILL 

RE: 09/494,096 rush 
High 



NPL Adoni, 

Jfjp- Biotech 

^0 Vol NO 

Ck Cffe p 
Call#. ^ 



Main 



Please deliver: 

J All ergy Clin. Immunol. 109:143-149, 2002. ^ .p. / 

J. Allergy Clin. Immunol. 105:3 78-3 8472TO: ^ 
J. Allergy Clin. Immunol. 105:8140 (Abstract 423), 2000 
J Allergy Clin. Immunol. 105:8140 (Abstract 423), 2000 
Ann. NY Acad Sci. 964:69-85, 2002. 

Fenoiraetal, Adv Exp Med Biol 409: 127-135, 1996. 

Fasler et al, J. Allergy and Clinical Immunology 101(4 pt 1): 521-30, April 1998. 
Burks et al, Eur. J. Biochem. 245: 334-339, April 1997. 

Stanley et al. Archives of Biochemistry and Biophysics 342(2): 244-253, June 1997. 
Skolnick et al, Trends in Biotech. 18(1): 34-39, Jan 2000. 

Colman et al, A structural view of immune recognition by antibodies, pages 33-36, 1994. 
Evens et al. Therapeutic Drug Monitoring 15: 514-520, 1993. 



Thanks, 
Neon 

-Art unit 1644 
Room REM 3D75 
Tel (571) 272-0846 



Sli?4 Abstracts 



J ALLERGY CLIN IMMumqi. 

JANUARY 2000 



554 Structure Plays a Critical Role in the Allergenicity of Food Pro- 
teins GA Bannon'^t Kopper*^ SM Maleki, DS Shm*t, HA 
Sampsont§, AW Burkst^ ^Department of Biochemistry and Molec- 
ular Biology tliniversity of Arkansas for Medical Sciences 
^Department of Pediatrics §Mt Sinai School of Medicine 

In the past decade there has been an increase in allergic reactions 
to peanut proteins, sometimes resulting in fatal anaphylaxis. The 
development of treatment strategies for peanut allergies requires a 
better understanding of the immune response to these proteins and 
a better understanding of the structural aspects of the allergens. One 
of the classic characteristics of food allergens is that they are resis- 
tant to degradation during processing and digestion. In order to dis- 
cover the reasons food allergens can resist degradation, the structure 
of the major peanut allergens, Ara h 1 and Ara h 2, were studied in 
relation to their IgE-binding epitopes. Ara h 1 is a protein belong- 
ing to the vicilin family of seed storage proteins while Ara h 2 is a 
conglutin-like seed storage protein. Both allergens are recognized 
by serum IgE from > 90% of peanut-sensitive patients. Ara h 1 was 
shown to form a highly stable homo-trimer that was stabilized by 
hydrophobic interactions. A molecular model of the Ara h 1 trimer 
was constructed to view the stabilizing hydrophobic residues in the 
three dimensional structure. Hydrophobic amino acids that con- 
tribute to trimer formation are at the distal ends of the 3-D structure 
where monomer-monomer contacts occur. Coincidentally, the 
majority of the IgE-binding epitopes are also located in this region 
suggesting that they may be protected from digestion by the 
monomer-monomer contacts. Upon incubation of Ara h 1 with 
digestive enzymes, various protease resistant fragments containing 
multiple IgE-binding sites were identified. These peptide fragments 
were protected from digestion for up to 3 hrs. On the other hand, 
Ara h 2 does not appear to form any stable higher order oligomeric 
structures. However, numerous intra-molecular disulfide bridges 
stabilize its tertiary structure. Just as observed with the Ara h 1 pro- 
tein, the native form of the Ara h 2 allergen is also highly resistant 
to digestion with enzymes commonly found in the human gastroin- 
testinal tract. When the Ara h 2 protein is reduced, thus breaking the 
disulfide bonds and destroying the tertiary structure of the protein, 
it becomes highly susceptible to digestion. In addition, when the 
Ara h 2 disulfide bonds are allowed to re-form in a random manner, 
the allergen is digested at the same rate as the reduced form of this 
protein. Collectively, these results show that the tertiary and quater- 
nary structure of these two food allergens contribute significantly to 
their resistance to digestion, a classic characteristic of a protein des- 
tined to be a food allergen. 

555 Identification and Characterization of IgE Binding Epitopes of 
Patatin, a Major Food Allergen of Potato James Astwood*, Mur- 
taza Alibhai*, Thomas Lee*, Roy Fuchs*, Hugh Sampson f * Mon- 
santo Company jMount Sinai Hospital 

Patatin has recently been identified as the 42 kDa major food 
allergen of potato (Sol t 1). Although potato allergy is rare, we have 
identified five potato allergic patients with positive skin prick tests 
and positive DBPC food challenges. Symptoms ranged from atopic 
dermatitis to food anaphylaxis in these patients, many of whom 
have been hospitalized numerous times after eating potato. Serum 
from these patients selectively identified and verified patatin as a 
major allergen from potato tubers by immunoblotting patatin pro- 
tein purified from plants or total protein from potato tuber extracts. 
All patients indentified the 42 kDa patatin protein. The genes for 
patatins, a large gene family composed of numerous isoforms and 
which represent -40% of potato tuber protein, have been isolated 
many years ago. The availability of DNA and protein sequences 
allowed us to utilize positive IgE binding from each patient to iden- 
tify the IgE binding epitopes of patatin. By synthesizing 89 over- 
lapping decamer oligopeptides representing the full patatin primary 



amino acid sequence, we have identified the major and minor jog 
binding epitopes of this allergen. Subsequently, we have identified 
within these epitopes, substitutions in amino acid residues that 
appear to reduce or abolish IgE binding to these peptides. By iden- 
tifying amino acid sequences that potentially reduce IgE binding, it 
may be possible to engineer patatins with reduced allergenicity for 
use in immunotherapy and other applications. 

556 Macadamia Nut Anaphylaxis: Demonstration of Specific IgE 
Reactivity and Partial Cross Reactivity With Hazelnut Michael 
Sutherland, Robyn O'Hehir, Daniel Czarny, Cenk Siiphioglu 
Department of Allergy, Asthma and Clinical Immunology, Monash 
University Medical School, The Alfred Hospital, Prahran, Victoria, 
Australia 

BACKGROUND: Anaphylaxis due to nut allergy is a serious 
and potentially fatal condition. There have been no reported cases 
of anaphylaxis due to macadamia, an Australian tree nut originating 
from Macadamia intergrifolia, Macadamia tetraphylla and their 
hybrids. We report a case of anaphylaxis due to macadamia nut 
ingestion and confirmation of the causative allergen by 
immunoblotting. 

AIM: To confirm macadamia nut as the causative allergen and 
characterise its protein and allergenic profile. 

CLINICAL SUMMARY: An 1 8 year old female with no histo- 
ry of nut allergy experienced anaphylaxis after eating a flourless 
orange cake made with macadamia meal. On review one month 
later, skin tests to raw macadamia nut were positive (30 mm wheal). 

METHODS: Extracts of raw and roasted macadamia nut, 
peanut and hazelnut were resolved on SDS-PAGE and proteins 
stained with Coomassie Brilliant Blue (CBB), Nut extracts were 
also electroblotted to nitrocellulose membranes and incubated with 
the patient and non allergic control sera before detection with rab- 
bit anti-human IgE and peroxidase labelled mouse anti-rabbit IgG. 
Inhibition immunoblots were performed against macadamia extract 
with pre-incubation of the allergic patient sera (1:60 dilution) with 
25 |ag, 50 \i% and 100 ^g of macadamia, peanut and hazelnut 
extracts and 25 p.g of macadamia oil extract. 

RESULTS: CBB staining showed macadamia to have a differ- 
ent protein profile to the other nut allergens. The immunoblot 
experiments showed strong IgE binding of the patient (but not the 
control) serum to a 17.4 kDa protein in the macadamia extract (raw 
and roasted), with no significant binding to peanut and some bind- 
ing to hazelnut. The inhibition immunoblots showed IgE binding 
was abolished by pre-incubation with macadamia extract, unaffect- 
ed by pre-incubation with peanut and weakly inhibited by 100 pg 
of hazelnut and 25 |ig of macadamia oil extract. 

CONCLUSIONS: Macadamia nut can be a cause of anaphy- 
laxis. A 17.4kDa protein present in both raw and roasted 
macadamia nut extracts is the principal allergen in this patient. 
Hazelnut and macadamia nut oil exhibit some IgE binding capacity 
and should also be avoided by macadamia allergic patients. 

Financial support from the National Health and Medical 
Research Council, the Australian Allergy Foundation and The 
Alfred Research Trusts is acknowledged. 

557 IgE Antibodies to Patatin of Potato Tiiber (Sol t 1) in Atopic 
Infants Suspected for Potato Allergy Hell Majamaa*, Ulla Sep- 
pdldft, Kristiina Turjanmaa^, Timq Palosuot, ^ Kalkkinenf, T 
Reunala "^ *Department of Dermatology, University Hospital of Tam- 
pere, Finland flnstitute of Biotechnology, University of Helsinki, 
Finland ^National Public Health Institute, Helsinki, Finland 

Clinical manifestations of food allergy are most commonly 
observed within first tliree years of life. Allergy to potatoes is consid- 
ered fairly uncommon although potatoes are widely consumed as a 
staple and one of the first solid foods encountered by infants. When 
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Linear IgE epitope mapping of the 
English walnut {Juglans regia) major 
food allergen. Jug r 1 

Jason M. Robotham, BS,^ Suzanne S.Teuber, MD,^ Shridhar K. Sathe, PhD,'^ and 
Kenneth H. Roux, PhD^ Tallahassee, Fla, and Davis, Calif 



Background: Peanut and tree nut allergies can be life-threat- 
ening, and they appear to be growing in prevalence. Jug r 1, a 
2S albumin seed storage protein, was previously characterized 
as a major English walnut food allergen. 
Objective: We souglit to identify llie linear IgE-binding epi- 
topes of Jug r 1 and to determine wliich, if any, amino acids 
arc necessary for this binding to occur. 

Methods: Pools of sera from walnut-allergic patients and over- 
lapping peptides synthesized on an activated cellulose mem- 
brane were used to screen for IgE-binding epitopes. Mutation- 
al analysis of the immunodominant epitope was carried out 
through single and mullisite amino acid substitutions. Inliibi- 
tion assays were performed througli use of affinity-purified 
IgE, soluble forms of the epitope peptide, and the recombinant 
2S albumin, rjug r 1. 

Results: One Immunodominant linear epitope was identified. 
Amino acid mutations to the epitope demonstrated that the 
residues RGEE, at positions 36 througli 39, were minimally 
required for IgE binding. Probing of tliis epitope with sera 
from each of 20 patients revealed 15 of the sera to be positive. 
Binding of patients' IgE to (he epitope wasinliibited with a 
sttluble form of the peptide; however, soluble peptide did not 
ctmipletely inhibit the binding of IgE to the intact rJug r 1. 
Conclusion: One major linear IgE-reactive epitope and its crit- 
ical core amino acid residues have been identified. Mutation of 
any of these core amino acids resulted in loss of IgE binding to 
the epitope, and this points toward the feasibility of reducing 
allergcnicity in genetically modified walnuts. However, strong 
evidence for the existence of conformational epitopes was also 
obtained. (J Allergy Clin Immunol 2002; 109: 143-9.) 

Key words: Epitope mapping*, 2S albumin, walnut, nut allergy, Jug 
r I, food allergy 



Fioin '^tlie Depailiiient of Biologic:)! Science and Structural Biology Progniin, 
FlniifUi Stiite University, Tiilliihiissee; ^^the Department of Internal Medi- 
cine, Division of Rheumatology, Allergy and Clinical Iminunulogy, Uni- 
versity of California, Davis, School of Medicine; and '^the Department of 
Nutrition, Food, and Exercise Science, Florida State University, Tallahas- 
see. 

Funded in part by the International Life Sciences Institute, the Allergy and 
Iinniunology Institute (Suzanne S. Teuber), USDA NRICGP grant 
(#9901530), and a training grant fellowship from Florida State University 
NSF-RTG (#9602233). 

Received for publication July 26, 2001; revised September 20, 2001 ; accept- 
ed for puhliciition September 25, 2001 . 

Reprint requests: Kenneth H. Roux, PhD, Department of Biological Science, 
Biology Unit I, Florida State University, Tallahassee, FL 32306-4370. 

Copyright 0 2002 by Mosby, Inc 

009 1 -67d 9/2002 $35.00 + 0 1/87/120S58 

doi: 1 0. 1 067/m;u .2002.1 2055 fi 



Abbreviations used 

DMF: N,N-dintethyIformaniide 
Fmoc: 9-fluorenlymethoxy carbonyl-derived 
NC: Nitrocellulose 
RT: Room temperatute 
TBS: Tris-buffeicd .saline [solution] 
TBS-T: Tris-buffered saline [solution] with 0.2% Twceii-20 



It is estimated that up to 8% of children less than 3 
years of age and 2% of adults are affected by food aller- 
gies.^ Although food-induced allergic reactions are the 
most common cause of outpatient anaphylaxis, ^ most 
severe reactions of this kind are caused by peanuts and 
tree nuts/^'"^ Most plant food allergens can be found 
among pathogenesis-related proteins, seed storage albu- 
mins and globulins, and a-amylasc and protease 
inhibitors.^ A previous study by Teuber et aK^ document- 
ed the cloning and sequencing of a gene encoding Jug r 
1, a major allergen in the English walnut, Juglans regia. 
Jug r 1 is a 2S albumin seed storage protein and possess- 
es important homologies in amino acid sequence with 
other 2S albumin proteins from Brazil nut, cottonseed, 
castor bean, and mustard/^ Like many 2S albumins, Jug r 
1 is synthesized as a precursor protein and cleaved into a 
large subunit and a small subunit, these being joined by 
disulfide bridges. It is currently believed that patients 
with life- threatening allergies to walnuts and other tree 
nuts will rarely become tolerant of these foods'' and con- 
sequently face a lifetime of avoidance. Avoidance, how- 
ever, is often difficult because of the ubiquitous nature of 
these foods and the potential for cross-contamination 
during their processing. 

Many approaches to allergen immunotherapy under 
investigation today are based on a detailed knowledge of 
the amino acids found in IgE-reactive B-cell epitopes.*^-'^ 
These reactive sites can be either linear or conformation- 
al. Typically, a hnear epitope contains a stretch of con- 
tiguous amino acids spanning 5 to 10 residues in the anti- 
gen, whereas residues distant in the primary sequence but 
proximate in the folded protein constitute a conforma- 
tional epitope. '3 In recent years, the genes encoding sev- 
eral proteins with allergen icity, such as Jug r 1 , have been 
cloned and expressed through use of molecular biology 
techniques.^^''4-lf> Despite the rapidly increasing number 
of recombinant allergens, relatively few TgE-reactive B- 
cell epitopes have been defined. '^--^ 
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The specific aim of this research was to screen the 
large and small subunits of the English walnut allergen 
Jug r 1 for linear IgE-binding epitopes. We report the 
finding of a single immimodominant IgE-binding epitope 
residing in the large subunit, and we have identified 4 
core amino acids necessary for this binding to occur 

METHODS 
Human sera 

The study was approved by the Human Subjects Review Com- 
mittee at the University of California, Davis. vSera from 20 walnut- 
allerj^ic patients with convincing histories of life- threatening sys- 
temic allergic reactions to wahiuts, positive ImmunoCAP assays 
(Pharmacia Diagnostics, Columbus, Ohio), and evidence of IgB 
against rJug r 1 (see below) were used in this investigation. Control 
sera were obtained from atopic patients with no history of walnut 
sensitivity and 1 patient who had life- threatening reactions to wal- 
nuts, positive ImmunoCAP assay, and positive serum IgE 
imnmnoblot to walnut proteins but was negative for binding to the 
2S albumin. 

Expression and isolation of rJug r 1 

Two protocols for isolation of rJug r 1 were used, hi the first, 
used to produce rJug r 1 for screening patient sera by Western blot- 
ting, the Jug r 1 insert was subcloned into the pPROEXHT expres- 
sion vector (Gibco BRL, Invitrogen Life Technologies, Carlsbad, 
CaliO and transformed into Escherichia coli BL21-CodonP]us 
(vSlratagene Inc, La Jolla, Calif) Cells were grown overnight at 
37^'C in 5 mL 2X YT broth with 50 [ig/mL ampicillin and chloram- 
phenicol. This was then added to 500 mL 2X YT broth and grown 
to an A(-,()o of LO. Isopropyl thiogalactose was added to 0.5 mmol/L, 
and the culture was grown at 37 ""C for 4 hours. Cells were pelleted, 
washed with 0.02 mol/L PBS (pH 7.3), and resuspended in BRL 
Lysis Buffer (Gibco BRL). Lysozyme was added to 500 pg/mL on 
ice for 15 minutes; (his was followed by the addition of .sarlcosyl to 
1 % w/v. The cells were sonicated, and the insoluble matter wiLS pel- 
leted by centrifugation at 12,000^^ for 20 minutes at A^C. Ni-NTA 
resin (Gibco BRL) was added in BRL Buffer A (Gibco BRL) to the 
supernatant and allowed to incubate overnight at 4^C. The resin was 
washed 4 times with BRL Buffer A, and the His-tagged fusion pro- 
tein was then eluted with BRL Buffer C (Gibco BRL) containing 
0.25% sarkosyl. 

hi the second protocol, rJug r 1 was expressed in fusion with glu- 
tathiones -trans f erase and purified after the Frangioni and Neel-^" 
protocol with slight modifications, as previously described by Teu- 
ber et al.^ This glutathionc-S-transferase^-Jugr 1 fusion protein was 
used in all dot-blot assays, as described below. 

Immunoblotting of rJug r 1 

rJug r 1 samples were either subjected to SDS-PAGE and elec- 
trophoretically transferred to nitrocellulose (NC) or blotted directly 
onto the membrane. For SDS-PAGE analysis of rJug r 1, samples 
were boiled for 5 minutes in sample buffer (60 mmol/L Tris-HCl, 
pll 6.8, 2% SDS, 10% v/v glycerol, 0.01% w/v bromophenol blue) 
and electrophoresis was earned out with an 8-mA constant cuirent 
llu ough use of a SE600 Vertical Slab Gel Unit (Pharmacia Biotech, 
Piscataway, NJ). An SDS-PAGE gel (13% monomer acrylamide 
concentration) with 1 lig of protein per 4 mm was used for 
ininiunoblotting, as previously described.^^' Protein was tiansfen ed 
to a 0.22-|.lm NC membrane (MSI, Westborough, Mass) overnight 
at 30 V through use of a TE42 Transphor Electro-Transfer Unit 
(Phmmacia Biotech). The blot was cut into 4-mm-wide strips and 
blocked for 1 hour at room temperature (RT) in PBS/3% nonfat dry 



milk/0.2% Triton X-100 (TX-lOO). Diluted .sera, 1:5 v:v in the 
blocking buffer, were added to the strips and incubated overnight at 
RT. The strips were then washed for 20 nunutes 3 times in 
PES/0.01% TX-lOO and incubated overnight at RT with equine 
polyclonal '^Sj.ju^^ii^m^^m^ jgg (Hycor Biomedical Inc, Garden 
Grove, Calif) diluted 1:5 in the nonfat milk buffer. The strips were 
washed for 20 minutes 3 times and exposed to Kodak Biomax x-ray 
film (Kodak, Rochester, NY) at -70''C for 48 hours. All incubations 
were ciirried out with rocking. 

For dot-blot assays, a giaphite pencil was useil to circumscribe 4 
mm X 8 mm elipses on a dry 0.4-^Lm NC membrane (Shleichcr & 
Schuell, Keen, NH). The NC was incubated in distilled-deionized 
water on a rocking table for 5 minutes and allowed to air- dry before 
each dot was loaded with 0.1 1 |ag of rJug r 1 in 2 f^iL distilled-deion- 
ized water. The antigen-loaded NC was then rinsed in distilled-deion- 
ized water as described above, placed protein- side-up on 3MM paper 
(Whatman Intl Ltd, Maidstone, United Kingdom), and dried under a 
60-W light until all moisture was removed. Once diy, dots were 
rinsed for 2 minutes in TBS-T (Tris-buffcred saline solution [TBS; 20 
mmol/L Tris, 137 mmol/L NaCl, pH 7.6] containing 0.2 % Twecn- 
20), blocked for 1 hour at RT in TBS-T containing 2% BSA (Sigma, 
St Louis, Mo), and then incubated overnight at 4''C with sera diluteil 
1:40 in TBS-T. Membranes were then washed once for 15 minutes 
and 3 times for 5 minutes in TBS-T before being incubated for 1 liour 
at RT with horseradish peroxidase-labeled goat-antihuman IgE 
(Biosource Tnteniationai, Camarillo, CaliO diluted 1:2000 in TBS-T. 
Washing in TBS-T was re])eated as described above, and tlie reactive 
dots were identified after (1) a 5-minute incubation in Enhanced 
Chemiluminescence Plus (Amersham Pharmacia, Piscataway, NJ), 
prepared according to the manufacturer's instructions, and (2) subse- 
quent exposure to Kodak X-OMAT x-ray fdin. 

Solid-phase peptide (SPOTs) synthesis 

On the basis of the published amino acid sequence of Jug r 1 and 
our unpublished data detemiining the subunit cleavage sites,*^ twenty- 
five overlapping 13-amino acid peptides, offset by 3 amino acids, 
were synthesized; these corresponded to the entire 64— amino acid 
length of the large subunit (Fig 1, A) and the 26-amino acid length 
of the small subunit (Fig 1 , B). Additional peplitles bearing the tar- 
get epitope (QGLRGEEMEEMV), mutated forms of this peptide, 
and a non-IgE-binding peptide (LSQRGLQSSSV) were also syn- 
thesized. Thirteen versions of the target peptide were created 
through a single-site alanine substitution at each position along the 
amino acid sequence, and 6 mutated peptides were synthesized 
tluough use of multiple alanine substitutions (Table 1). 

Peptides were synthesized on derivatized cellulose sheets tluough 
use of 9-fluorenlymethoxy carbonyl -derived (Fmoc) amino acids, as 
described by the manufacturer (Genosys Biotechnologies, Inc, The 
Woodlands, Tex). Briefly, cellulose membranes containing free 
hydroxy groups were esterfied with an Fmoc-amino acid dissolveil 
in l-methyl-2 pyiTolidinone. The coupling reaction was followed by 
washing in N,N-dimethy!formamide (DMF), and the amino acids 
were acetyl ated with acetic anhydride. The membranes were depro- 
tected by washing in DMF and incubation with a solution of 20%; 
piperidine in DMF; this was followed by another wash in DMF. 
Coupling, acetylation, and deprotection steps wei e icpeated for each 
cycle. During the final cycle, a mixture of dichloromethanc, trifiuo- 
roacetic acid, and triisobutylsilane (1:1:0.05) was used to deprotect 
the acid-labile amino acid side chains. 

IgE binding to solid-phase synthetic peptides 
(SPOTs analysis) 

The peptide-containing membranes were washed in TBS and 
incubated overnight at RT in blocking solution as directed by the 
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FIG 1. Overlapping peptide synthesis of Jug r 1. A, Nineteen 13-amino acid peptides, offset by 3 amino acids, 
were synthesized; they corresponded to the entire 64^annino acid large subunit. B, Slmllarlv, 6 peptides were 
synthesized; they corresponded to the entire 26-amino acid small subunit. Glycine residues were added to 
the C-terminal end of each sequence (shaded) so that all synthesized peptides were of equal length. 



manufacturer (Genosys Biotechnologies, Inc). Membranes were 
then washed in TBS-T for 10 minutes and incubated overnight at 
4"C with an individual patient's serum or pooled patients' sera dilut- 
ed 1:5 in blocking buffer (total sera:blocking buffer). This incuba- 
tion was followed by three 5-minute washes in TBS-T and an 
overnight (4''C) incubation with '^-^I-antihuman IgE (Hycor Bio- 
medical Inc, Garden Grove, Calif) diluted 1 : 10 in a mixture of PBS, 
5% nonfat dry milk, and 0.05% Tween-20. Thixe final 10-minute 
washes in PBS/0.05% Tween-20 were performed, and IgE-peptide 
reactivity was identified after a 48-hour exposure at -70^ to Kodiik 
Bloniax x-ray film. 

Soluble epitope peptide synthesis 

A soluble form of an identified IgE-reactive peptide 
(QGLRGEEMEEMV) was synthesized by Fmoc protocols on an 
automated peptide synthesizer (Model 433A, Applied Biosystems, 
Foster City, CaliO- 

Isolation and inhibition of epitope-specific 

The epitope-reactive IgE was isolated from patients' sera by affini- 
ty chromatography. A ().5-mL pool of patients' sera, known to be reac- 
tive to the maclive synthetic solid-phase epitope peptide, was slowly 
atklcd to a 5-tnL disposable polypropylene column (Pierce Chemicid 
Company, Rock ford, III) containing cyanogen-bromide-activated beads 
(Sigma) to which 5 mg of the reactive peptide had been coviilently cou- 
pled, as described by the manufacturer; the effluent was collected. 
Bound IgE was eluteii with the addition of 0.2 mol/L glyciiie sulfate 
(pll 2.3) collected in a beaker contmning 10 ^iL 1% BSA in buffered 
saline solution borate and subsequently neutralized with 1 .0 mol/L Tris. 
The colunui was then rinswl with buffered saline solution borate and 
the initial effluent repassed over the column. The process was repeated 
for a total of 3 times, yielding 3 eluates and the colunui effluent. 

To test the specificity and reactivity of the epitope-specific IgE, 
control and pooled and fractionated patients' sera were preincubat- 
ed overnight at 4°C with different anriounts (70, 7.0, and 0.7 |.ig) of 
the soluble epitope peptide or a soluble nonspecific peptide. The 
pre incubated sera were used to probe either solid-phase IgE-reac- 
tive and non-IgE- reactive peptides or rJug r L IgE binding to the 
solid-phase synthetic peptides and dot-blotted rJug r 1 was detected 



TABLE I. Mutational analysis of the Jug r 1 IgE binding 
epitope, El 



Amino acid sequence* 


Substitution 


IgE 

blndingt 


AGLRGEEMEEMV 


QIA 


++ 


QALRGEEMEEMV 


G2A 


++ 


QGARGEEMEEMV 


L3A 


+ 


QGLAGEEMEEMV 


R4A 




QGLRAEEMEEMV 


G5A 




QGLRGAEMEEMV 


E6A 




QGLRGEAMEEMV 


E7A 




QGLRGEEAEEMV 


MSA 




QGLRGEEMAEMV 


E9A 




QGLRGEEMEAMV 


El OA 


-1- 


QGLRGEEMEEAV 


MllA 




QGLRGEEMEEMA 


V12A 




QAARGEEMEEMV 


Ct2A,L3A 


+ 


OAAAGEEMEEMV 


G2A,L3A,R4A 




OAAAAEEMEEMV 


G2A,L3A,R4A,G5A 




QGLRGEEMEAAV 


E10A,M11A 


+ 


QGLRGEEMAAAV 


E9A,E10A,MI1A 




OGLRGEEAAAAV 


M8A,E9A,EI0A,M11A 





*Tlie criticiil core ;aiiin(3 acid residues, loc;iTed at positions 36 through 39 of 
the large subunit, ;iirI an infhienti;i] gliUiiiiiic acid lusidue at position 42 art: 
indicated in bold type. Mntated residues are underlined. 
tPeptides were probed with pooled with pool 1, comprising seia from 6 
wiilniit-sensitive patients. 



through use of '^^1-antihunian TgE and Enhanced Chemiluntines- 
cence Plus, respectively, as described earlier. 

RESULTS 

IgE binding to rJug r 1 

IgE from 20 walnut-sensitive patients was shown to 
bind the 20-kDa Ilis-taggcd rJug r 1 (Fig 2). Three sera 
showed only faint binding, whereas the other 17 sera 
gave moderate or strong signals. Recombinant Jtig r 1 
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FIG 2. IgE immunoblotting of rJug r 1. C7, Serum from a walnut- 
allergic patient that does not bind to Jug r 1; C2, serum from an 
atopic control patient not allergic to walnuts; 2Ab, the secondary 
antibody alone. 
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FIG 3. IgE binding analysis of Jug r 1. A, SPOTs array depicts all 
25 overlapping 13-amino acid Jug r 1 large and small subunit 
peptides probed with pool 1, comprising sera from 6 walnut- 
sensitive patients. B, Sequence comparison of 3 adjacent over- 
lapping peptides — 10, 11, and 12 — recognized by walnut-allergic 
sera. The amino acid sequence common to each Is indicated 
(shading), 

was previously shown to completely inhibit binding to 
the native 2S albumin and to substantially inhibit binding 
of patient sera IgE to English walnut in walnut Immiino- 
CAP assays (up to 84% of measurable antiwalnut TgE)^^; 
the folding thus appears to preserve most of the pre- 
sumed conformational epitopes present on the native 
mature protein. 

Identification and recognition of IgE-reactive 
linear epitopes on rJug r 1 

The entire lengths of both the large and the small subunits 
of Jug r 1 were studied by probing overlapping solid-phase 
synthetic peptides with sera from 20 patients randomly 
assigned to 4 pools. Each pool recognized 3 adjacent pep- 
tides from the large subunit; 2 peptides were recognized 
very strongly (no. 11 and no. 12) and 1 peptide less so (no. 
10); no peptides were identified from the small subunit (Fig 
3, A). A common sequence, GLRGEEM, was observed in 
all 3 large-siibunit peptides (Fig 3, B). A fourth partially 
overlapping sequence (no. 9 in Fig 3, A) showed a slight 
positive reaction. Some other peptides showed slight posi- 
tive signals in some assays (eg, no. 7 in Fig 3, /i) but were 
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FIG 4. Differential binding of IgE from the sera of 20 walnut-aller- 
gic patients to the El epitope (+, left SPOT) and negative control 
right SPOT) peptides. A, Autoradiograph shows patients' sera 
binding to the El peptide strongly (nos. 32, 68, 139, 152, and 191), 
moderately (nos. 38, 223, 237, 4, 218, and 172), weakly (nos. 35, 
98, 78, and 173), or not at all (nos. 42, 160, 100, 115. and 178). B, 
Longer exposure of weakly reactive sera (nos. 35, 98, 78, and 173). 
CI. Serum from a walnut-allergic patient that does not react with 
Jug r 1; C3, serum from an atopic subject not allergic to walnut. 

not reproducible, leading us to examine only the identified 
dominant linear epitope-bcaring peptides. Additional pep- 
tides were tested in which alanine was substituted at each of 
the 12 amino residues of the IgE-reactive peptide, 
QGLRGEEMEEMV. In addition, peptides were tested with 
Viuying numbers of alanines substituted at the N-terminus 
and the C-tcrminiLS. Together, these data demonstrated that 
the core amino acids RGEE, at positions 36 to 39, and an 
additional glutamic acid residue at position 42, were neces- 
sary for maximum IgE binding to occur (Table I). 

SPOTs containing a 12-amino acid peptide 
(QGLRGEEMEEMV) bearing the inununodominant 
epitope, designated El, and SPOTs containing a negative 
control (-) peptide (LSQRSQQQCRQ) selected from the 
large subunit of Jug r 1 were used to test the degree of 
individual reactivity and specificity of allergic and con- 
trol sera. Of the 20 sera tested, 5 exhibited strong recog- 
nition of the epitope peptide, 6 exhibited moderate recog- 
nition, 4 exhibited weak recognition, and 5 showed no 
recognition; none reacted with the (-) peptide (Fig 4). 

Isolation and iniiibition of El-specific IgE 

To determine whether the identified epitope El is the 
major epitope recognized by patients' sera, we separated 
El-specific IgE from the total antisemm and tested both 
fractions for reactivity with rJug r I. Epitope-specific IgE 
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was. isolated from patient serum by passage over an El- 
affinity column. To assure removal of anti-El antibodies, 
tlie 3 sequential eluate and the serum effluent fractions 
were first assayed against our solid-phase IgE-binding (+) 
and non-IgE-binding (-) peptides. Fig 5, A shows binding 
of IgE to the positive (El) peptide in unfractionated 
patient serum, progressively less binding in the first 2 ehi- 
ate fractions, and no binding in the third fraction, indicat- 
ing complete removal of the El -reactive antibody by the 
cohimn. Significantly, the El -adsorbed serum (eftluent) 
also showed no reactivity to the El peptide, confirming 
removal of El -specific IgE from the serum. The eluate 
and effluent fractions were then used in dot-blot assays to 
determine whether TgE antibodies to the El epitope rep- 
resent the bulk of the rJug r 1-reactive IgE. As expected, 
there was demonstration of binding of IgE to rJug r 1 
from total sera and from the first 2 eluate (anti-El) frac- 
tions but not from the third (anti-El -depleted) fraction 
(Fig 5, B). However, there was considerable rJug r 1- 
specific IgE remaining in the effluent (anti-El depleted 
fraction), demonstrating the presence of additional IgE 
antibody specific for 1 or more conformational epitopes. 

To further examine the specificity of both our peptide- 
rcactive fractions and unfractionated Jug r l-reactive 
IgE, a soluble form of the epitope peptide was preincu- 
bated with whole patient serum as well as the El -specif- 
ic IgE antibodies in an attempt to inhibit their reaction 
with both our solid-phase peptides and rJug r 1. Varying 
amounts (70, 7.0, 0.7 of the peptide were incubated 
with whole patients' sera before the El epitope (positive) 
and negative solid-phase peptides were probed. Partial to 
complete inhibition was achieved (Fig 5, O, demonstrat- 
ing that the solid- and fluid-phase versions of the peptide 
are similarly recognized. 

The amount of sohible peptide (70 llg) needed to com- 
pletely inhibit binding of EI -specific IgE to the solid- 
phase version of this epitope was similarly used for inhi- 
bition studies involving unfractionated patient serum IgE 
and affinity coKimn purified IgE fractions against the 
rJug r 1. Inhibition was again observed in the eluate (El 
epitope-specific) IgE fractions. These data demonstrate 
that the epitope is similarly recognized in both the pep- 
tide and the recombinant protein. However, IgE reactivi- 
ty in both whole patient serum and the El -adsorbed 
eftluent fraction reacted with rJug r 1 (Fig 5, D), again 
demonstrating the existence of additional (presumably 
conformational) IgE-reactive epitopes on rJug r 1. 

DISCUSSION 

On a global basis, walnuts rank second in tree nut pro- 
duction.^^ In children, anaphylactic reactions to peanut and 
tree nuts are responsible for most fatal and near-fatal food- 
allergic reactions.-'' In a recent study, 32 fatal cases of 
anaphylactic reactions to foods reported to a national reg- 
istiy were analyzed; it was determined that up to 31% were 
due to tree nuts.^^ The severity and prevalence of some 
food-induced allergic reactions have prompted investiga- 
tion to identify and characterize offending allergens. 
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FIG 5, Isolation and inhibition of El-specific IgE. A, Strips con- 
taining the El epitope peptide {top SPOT, +) and a negative con- 
trol peptide {bottom SPOT, - ) probed with whole patient serum 
(32). each of 3 El eluates (pf, 2""^, 3'^) and the effluent (ef). B, 
Strips dotted with rJug r 1 and probed as in Fig 5, A C3, Sera 
from an atopic subject not allergic to walnut. C, Strips containing 
SPOTs as in Fig 5, A and probed with whole patient serum (32) 
containing varied amounts (in |xg) of soluble El peptide. D, Strips 
dotted with rJug r 1 and probed as in Fig 5, B, but with added sol- 
uble El inhibitor. 



Jug r 1, a major allergen in the English walnut, has 
previously been characterized as a 2S albumin seed stor- 
age protein.^ 2S albumins are found in almost all edible 
seeds. In view of the importance of the members of this 
class of protein as food allergens and the possibility of 
cross-reactivity among chiss-related proteins,-'*'' a 
detailed knowledge of these proteins would be valuable. 
In particular, given the critical role that allergen-specific 
IgE plays in the allergic reaction, determination of aller- 
gen-specific IgE binding epitopes — be they linear or con- 
formational — appears to be of great importance for a bet- 
ter understanding of the allergenic nature of foods and 
for possible therapeutic intervention. 
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We have found a single linear IgE-reactive epitope and 
have defined its core amino acid residues. To date, no 
common structural character of linear IgE epitopes has 
been identified,^^ but this could change as more epitope 
mapping studies are completed. A key step in the allergic 
reaction is the binding of at least 2 IgE antibodies to a 
multivalent allergen. The fact that we have foimd only 1 
linear epitope on Jug r 1 is unique in that all previously 
analyzed allergens have contained multiple linear IgE- 
binding sites.^^ Although we cannot nde out the exist- 
ence of weakly binding peptides that are undetected by 
our methods, the affinity of such peptides would be much 
lower, as indicated by the high intensity of the signal on 
the epitope El. Our findings, in combination with the 
knowledge that allergens must be multivalent to elicit an 
allergic response, point toward the existence of confor- 
mational epitopes on Jug r 1 as well. Although the data 
are qualitative, it is noteworthy that all 20 patients' sera 
bound rJug r 1 (Fig 2), whereas only 15 sera bound the 
hnear epitope, many of these exhibiting disproportion- 
ately faint binding. This too is evidence that conforma- 
tional epitopes arc relevant and key in IgE binding for 
some patients' sera. In fact, it has been estimated that 
most protein epitopes are conformational.^^' It is worth- 
while to note that the labeling of an epitope as "linear," 
based on the use of synthetic epitopes, is imprecise 
because the identified epitope might be a fragmented part 
of a larger, discontinuous epitope.35 

Clinically, insight into the IgE-binding epitopes of 
allergens, linear or conformational, is extremely useful. 
Identifying the immunodominant linear IgE-binding 
epitope of Jug r 1 might lead to better designs for wal- 
nut allergy therapy. Currently, there is no generally 
agreed-on treatment for IgE-mediated food allergy, and 
complete avoidance of the food is therefore recom- 
mended.-'*"^ With tree nuts such as walnuts, however, 
avoidance is often difficult because equipment used for 
food processing is often shared; the implicated allergen 
thus might be present in trace amounts or might be 
added to foods in which its presence is not expected by 
the consumer. Because tree nut allergies are rarely 
resolved, reactions can be severe, and accidental inges- 
tion is almost inevitable, life for afflicted patients can be 
fraught with anxiety^^ 

Among new approaches to allergen immunotherapy 
and diagnosis are use of blocking peptides to inhibit the 
triggering of IgE-mediated hypersensitivity reactions, 
immunization with specific peptides to alter the type of 
immune response elicited, allergen reengineering to 
enhance protective responses, and use of defined pep- 
tides in diagnostic assays.^" A common factor for the 
application of these approaches is a required knowledge 
of the amino acids in the IgE-reactive sites of the aller- 
gen. We have mapped the major hnear IgE epitope of Jug 
r 1 and identified the critical core amino acid residues 
within the epitope. Additional studies are planned to 
identify the conformational epitopes on this allergen. 

We thank Rich Peterson Tor his valuable assistance. 
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Mutational analysis of the IgE-binding 
epitopes of P34/Gly m Bd 30K 



Rtcki IVl. Helm, PhD,^ Gael Cockrell, BS,^ Cathie Connaughton, BS,^ C. Michael 
West, BS,^ Eliot Herman, PhD,*^ Hugh A. Sampson, IVID,^^ Gary A. Bannon, PhD,^ 
and A.Wesley Burks, MD^ Little Rock, Ark, Beltsvilh, Md, and Hew York, NY 



Background: Pcnniits and soybeans are 2 foods that have heen 
shown to be responsible for many atopic disorders. Because of 
their nutritional benclit, soybean proteins are now being used 
increasingly in a number of food products. Previous studies 
have documented multiple allergens in soybean extracts, 
including glycinin, (^-conglycinin, and the P34/GIy m Bd 30K 
protein. 

Objective: Our overall goal was to identify soybean-specific 
allergens to begin to understand molecular and immunochcnii- 
cal characteristics of legume proteins. The specific aim of the 
current investigation was to identify the essential amino acid 
residues necessary for IgE binding in the 5 distinct immuno- 
dominant epitopes of P34/Gly m Bd 30K. 
Methods: Serum IgE from 6 clinically sensitive soybean-aller- 
gic individuals was used to identify P34/Gly m Bd 30K in the 
native and single amino acid substituted peptides with use of 
the SPOTS peptide synthesis technique to determine critical 
amino acids required lor IgE binding. 

Results: The intensity of IgE binding and epitope recognition 
by serum IgE from the individuals varied substantially. With 
use of scrum from 6 clinically soybean-sensitive individuals, 2 
of the 5 immunodominant epitopes could be mutagcnized to 
non-lgE binding peptides. 

Conclusions: Single-site amino acid substitution of the 5 
immunodominant epitopes of Gly m Bd 30K with alanine 
revealed that IgE binding could be reduced or eliminated in 
epitopes 6 and lf» in the serum obtained from 6 soybean-sensi- 
tive patients, (J Allergy Clin Immunol 2000;105:378-84.) 

Key words: Soybean allergen, P34/CIy m Bd 30K, B-cell epitopes 

Soybeans and peanuts are both members of the legume 
plant family and they share common antigenic fractions. 
Patients allergic to one member often have IgE antibod- 
ies that react to other members of the legume family. 
Although cross-reacting proteins among soybeans and 
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Abbreviations used 
DBPCFC: Double-blind, placebo -control led food 

challenge 
Gly m Bd: Glycine max band 

rP34: Recombniant P34 protein 
TEST: TRIS-buffercd saline solution with 0.05% 
Tween 20 



peanuts are immimocheniically frequent, clinical respons- 
es to more than one is not coirunon.''^ A comparison of 
IgE immnnoblots to soybean extracts with common 
legumes showed that soybean extracts shared several 
common as well as unique proteins with other members 
of the legume family.'*-^ 

Previous studies have documented multiple allergens 
in soybean extracts, including glycinin, P-conglycinin, 
and the Glycine max band (Gly m Bd) 30K protein.*'^-" 
Ogawa et aF-^ initially described a soybean band desig- 
nated Gly m Bd 3 OK as a 7S soybean protein allergen, 
assigning it as the major allergenic protein in soybean 
extracts that bound specific IgE in the scrum of 65% of 
patients with atopic dermatitis in Japan. The first 15 
amino acids of Gly m Bd 3 OK were identical to those of 
a 34-kd soybean seed storage vacuole protein identified 
by Kalinski et al.^*^^ P34 is an outlying member of the 
papain superfamily'^'^^ that appears to be a Pseudomouas 
defense protein. 

The most efficient way to avoid soybean allergy is 
strict avoidance of food products containing allergenic 
proteins. Several methods are being conducted to reduce 
the allergenicity of allergens, including enzymatic meth- 
ods, physical treatment, or development of soybean 
seeds that lack the allergenic protein. A combination of 
both genetically selected soybean cultivars and physico- 
chemical separation methods appears to be the most 
promising method. With this in mind, our goal was to 
identify soybean allergens and specifically determine the 
presence of unique soybean protein epitopes in their 
amino acid structure that could potentially be mutage- 
nized to non-IgE-binding proteins. The prevalence of the 
hypersensitivity reactions to the allergen identified as 
Gly m Bd 3 OK and its homology to the previously iden- 
tified protein P34, a constituent of protein storage vac- 
uoles in soybean seed cotyledons, made this protein an 
ideal candidate for identification and characterization of 
the major IgE-binding epitopes.'-^ The specific aim of the 
current investigation was to identity the essential amino 
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acid residues necessary for IgE binding in the 5 distinct 
immunodominant epitopes in this major soybean aller- 
gen. In this article we report the localization of specific 
amino acids that can be substituted to render immun- 
odominant epitopes 6 and 16 to non-lgE-binding pep- 
tides. 

MATERIAL AND METHODS 
Serum IgE 

Twelve patients with atopic demiatitis and a positive immediate 
skill test response to soybean extracts were confinned to be sensitive 
to soybean protein by double-bhnd, placebo -controlled food chal- 
lenge (DBPCFC).^ The individual sera w^erc diluted 1 :20 (vol/vol) m 
TRTS-bufTered saline solution (25 mmol/L TRIS-hydro chloric acid, 
pH 7.2, 150 nimol/L sodium chloride) witli 0.05% Tween 20 (TBST) 
for inuiiunoblot analysis and in 1 :5 (vol/vol) TBST with blocking 
solution as recommended by tlie manufacturer (Genosys Biotech- 
nologies, The Woodlands, Tex) for mapping of IgE-binding regions 
and epitopes of the Gly m Bd 3 OK allergen. The Human Advisory 
Committee approved all studies at Uie University of Aikansas for 
Medical Sciences and Mt Sinai School of Medicme. 

Soybean extracts 

Preparations of defatted soybean flakes were obtained from 
commercial sources (Dr Tun Brooks, Ross Laboratories, Columbus, 
Ohio). Defatted (Sox hlct extraction with petroleum ether) soybean 
flakes were ground to a fine powder in a mortar and pestle and 
extracted (1 :20 wt/vol) in 20 mmot of sodiiun phosphate, 1 mmol of 
sodium chloride, pH 12, overnight at 4°C. 

A clone designated P34,^ representing the protein seed storage 
vacuole protem, was used as a source of recombinant protein. PCR 
teclmiques were applied to isolate the protein-coding region of dris 
clone. This portion was then subcloned into die pet-24(b)-t- vector 
for expression of recombinant protein in Escherichia coU. The pro- 
tein was expressed with a 6x-his Lag on its carboxyl tenninus. The 
6x-his tag allowed the purification of full-length protein by nickel 
resin column affinity chromatography. The purified protein was 
characterized by immunoblot analysis with seioim IgE from soy- 
bean-sensitive individuals. Results indicate that the recombinant 
P34 protein was recognized with a similar IgE-binding intensity as 
the native protein, suggesting that the recombinant protein can be 
used for future studies. The protein was desalted into PBS, concen- 
trated (4.2 mg/mL), filter sterilized, and stored for future use. 

SDS-PAGE/Western Immunoblot analysis 

SDS-PAGE was peifonned according to die mediod of Laemmli.*'^ 
All gels were composed of 12.5% aciylamide resolving gel and 4% 
acrylamide stacking gel. Clectrophoretic transfer of proteins and 
immunoblotting were perfonned according to standard procedures,'^ 
The blots were blocked with TBST with 1% BSA 60 minutes before 
incubation widi antibodies diluted in TRIS-buffered saline buffer for 
at least 12 hours at 4^C or for 4 hours at room temperature. Piimary 
TgE aiitibody was detected with radiolabeled anti-IgE antibody (Sanofi 
Pasteur histitute, Chaska, Minn) diluted 1:5 (vol/vol) in TRIS-buffcrcd 
saline buffer. Blots were washed 3 dmes in buffer, air dried, and 
exposed to Kodak X-Omat x-ray fihii at -lO'^C for 1 8 to 144 hours and 
developed l!1 an automatic developer (Konica SRX-101 ). 

IgE binding of mutagenlzed 
Immunodominant epitopes 

The immunodominant epitopes of P34 recognized by 4 clinical- 
ly soybean-sensitive individuals'^ were used to generate individual 
peptides for IgE-binding studies. Substitution of an alanine residue 
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for each of the wild- type amino acids in the sequence was used to 
detenninc which amino acids were essential for IgE binding, hidi- 
vidual peptides were syntliesized on a cellulose membrane contiiiii- 
tng free hydroxyl groups with use of Fmoc amino acids according 
to die manufacturer's instmctions (Genosys Biotechnologies). Syn- 
thesis of each peptide was started by esterification of an Fmoc 
amino acid to tlie cellulose membrane. After washing, all residual 
amino functions on the sheet were blocked by acetylation to render 
the cellulose membrane unreactive during subsequent steps. Each 
additional Fmoc amino acid was esterified to the previous one by 
this same process. After addition of die last amino acid in tlie pep- 
tide chain, the amino acid side chains were depiotected with a mix- 
ture of dichloromethanc-trifiuoroacefic acid-tri-isobutylsilane 
(1:1:0.05, vol/vol), followed by treatment with dichloiomethane 
and washing with methanol. Membranes containing synthesized 
peptides were washed with TRIS-buffered saline solution and then 
incubated with the Genosys blocking solution overnight at room 
temperature. The membranes were probed inmiediately wiUi semm 
IgE as described above or stored at -20'^C until used. IgE binding 
was detemiined by incubating cellulose membranes containing tlie 
synUiesized peptides with serum from individuals with documented 
soybean hypersensitivity and, as control, serum from an individual 
with high-titer IgE with no known food allergy. Primary antibody 
was detected with radiolabeled anti-lgE as described for auto radi- 
ographs of SDS-PAGEAVestem IgE immunoblot analysis. 

RESULTS 

SDS-PAGE of soybean extracts and 
recombinant P34 

Fig 1 shows a representative profile of an inmiunoblot 
of IgE binding to a crude soybean extract and to recom- 
binant P34 (rP34) (lanes 2 and 4, respectively). The IgE- 
binding is most intense at the 30-kd rP34 protein band. 
The IgE immunoblot sensitivity reveals radiographic 
binding at both higher and lower molecular weight bands 
that arc not discernible in the Coomassie bliie-stained 
photograph. These bands arc evident in the original 
stained gel and represent complexes and breakdown 
products of the rP34 molecule conmion to m^my recom- 
binant proteins isolated under these conditions. Control 
serum from nonleguminous-sensitive patients did not 
bind to any of the bands from our rP34 material nor did 
serum IgE from 'soybean-sensitive individuals bind to 
extracts of E coli in these regions (data not shown). 

IgE binding to Immunodominant epitopes 

The prevalence of serum IgE binding to epitopes 1, 6, 
13, 15, and 16 were previously identified as iimmmo- 
dominant epitopes with use of serum from 4 clinically 
soybean-sensitive individuals.'-"^ hi the current invesriga- 
tion we were able to add data from an additional 8 soy- 
bean-sensitive individuals. To determine the extent of 
patients' semm IgE binding to the iimiiunodominant 
epitopes, each epitope was synthesized as a single spot 
on SPOTs membranes and incubated with senmi from 
individual patients. The immunodominant epitopes are 
hstcd in Fig 2 with a representative blot of 6 patients' 
scrum IgE-binding pattern. The intensity of IgE binding 
and epitope recognition by serum IgE from the individu- 
als varied substantially. For example, serum IgE from 
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1 2 3 4 




FIG 1. SDS-PAGE IgE immunoblot analysis of crude soybean 
extract and rP34. Lane 1, Molecular weight markers. Lane 2, IgE 
immunoblot autoradiograph of crude soybean extract. Lane 3, 
Coomassie blue-stained rP34 protein. Lane 4, tgE immunoblot 
autoradiograph of rP34 protein. 



patients 5 and 3 bound all 5 epitopes with the same inten- 
sity. Patients 7, 8, and 1 demonstrated more intense stain- 
ing to epitope 15, whereas serum from patient 3 recog- 
nized epitope 13 as the predominant epitope. This 
difference in intensity and binding was characteristic of 
semm IgE from all 12 patients tested. Because the inten- 
sity of the binding varied, we chose to test patient serum 
against the 5 immunodominant epitopes. Table I shows 
a summary of the 12 patients' serum IgE that were test- 
ed for binding to the 5 immunodominant epitopes. We 
had 2 patients who did not recognize any of the 5 
innminodominant epitopes and 5 who recognized all the 
epitopes, and the remaining bound from 2 to 4 of the 5 
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TABLE I. Summary of P34 Immunodominant epitope map- 



ping by serum IgE from soybean-sensitive individuals 




P34 epitope 


binding 


1 


6 13 


15 16 


1* + 


± + 




2 + 




+ ± 


3* -t- 


± + 


+ -t- 


4 -t- 


i- 


+ + 


5* -\- 




+ -t- 




± + 


+ + 


7* + 


+ H- 


+ + 


8* + 


± ± 




9 


4- + 


- + 


10 






11 






12 


■i- + 


+ 


+, Tosiiivi; binding; ±, niartzinal hindiiit!; iiegativt; hindinu;. 


* Selected for mutagenized epitope rnapjiiiig. 




TABLE II. Mutagenesis of P34/Gly 


m Bd 30K epitope 1 




Mutagenized 


Position 


SPOT No. 


epitope 1 


change 


1 


ALVLLLFSLL 


FIA 


2 


FAVLLLFSLL 


L2A 


3 


FLALLLFSLL 


V3A 


4 


FLVALLFSLL 


L4A 


5 


FLVLALFSLL 


L5A 


6 


FLVLLAFSLL 


L6A 


7 


FLVLLLASLL 


F7A 


8 


FLVLLLFALL 


SSA 


9 


FLVLLLFSAL 


L9A 


10 


FLVLLLFSLA 


LlOA 


11 


FLVLLLFSLL 


Native 


Note: If an alanine residue resided in inutagenized Incatinn, glycine was 



used as substituted amino acid. 

inimunodominant epitopes. This is in agreement with 
our initia! results in that there appears to be a great 
amount of diversity in the immunodominant IgB-bind- 
ing epitopes. 

IgE binding to mutagenized 
immunodominant epitopes 

To detennine the amino acids that are critical to IgE 
binding of the 5 immunodominant epitopes, we mutage- 
nized the amino acid sequences by substituting alanine at 
each position and compared the IgE binding to native 
(wild-type) epitope. Table II depicts the position change 
for epitope 1. Each epitope was modified in this manner 
in a series of replicates for analysis with IgE from 6 of 
the P34-sensitive individuals. Fig 3 shows the results 
from 3 of these individuals. 

Serum IgE from patient 12 recognized only epitopes 6, 
13, and 16. Alanine substitutions in epitope 6 at positions 
6 and 7 and at position 7 in epitope 16 showed reduced 
IgE binding. Alanine substitutions at positions 1-6 and 8 
showed increased and position 7 reduced IgE binding 
with this patient*s serum IgE to epitope 16. Epitope 13 
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EPITOPES AMINO ACID POSITION 



1 6 13 15 16 1 2 3 4 5 6 7 8 9 10 11 




FIG 2. Autoradiograph of patient serum IgE binding to 5 immune- '^g to mutagenized P34/Gty m Bd 30K epitopes. 



dominant P34/Gly m Bd 30K epitopes. 



could not be mutagenized to a non-IgE-b hiding peptide 
with alanine substitutions at any position in the peptide 
with serum from patient 12. 

Serum IgE from patients 8 and 3 recognized all 5 
epitopes. Consistent with patient 12, serum from both 
patients failed to bind IgE at positions 6 and 7 of epitope 
6. Serum from patient 8 demonstrated that an alanine 
substitution did not significantly alter the IgE binding in 
the remaining sites of the epitopes investigated. The 
autoradiograph for patient 3, in addition to reduced IgE 
binding at position 6 and 7 of epitope 6, revealed both 
increased and decreased levels of radiostained intensity 
in the remaining sites of the immunodominant epitopes. 

Table ill summarizes the extent of IgE binding to 
mutagenized epitopes with the 6 individuals we tested. 
Notable in this table is the consistency with which we 
could or could not mutagenize IgE binding by alanine 
substitution. Epitopes 1, 13, and 15 could not be mutag- 
enized to a non-IgE-binding peptide, whereas epitope 6 
could be altered to a non-IgE-binding peptide by an ala- 
nine substitution at positions 6 and 7 in the amino acid 
sequence. Epitope 16 was more variable and could be 
mutagenized to a non-IgE-binding peptide at position 7 



for 4 patients (patients 4, 5, 1, and 12) and at positions 5 
and 6 for 2 patients (patients 3 and 8). This suggested that 
we could mutagenize epitopes 6 and 16 to non-IgE-bind- 
ing peptides by substituting alanine at specific sites in the 
sequence of rP34 for patients 3, 8, and 12. 

DISCUSSION 

Five immunodominant IgE-biuding epitopes were pre- 
viously identified with use of serum IgE from 4 soybean- 
sensitive individuals by B-ccll epitope mapping with 
overlapping peptides for the soybean protein identified 
as Gly m Bd 30K (Gly m Bd30K/P34), a plant thiol pro- 
tease. This protein has been identified as the major soy- 
bean allergen with a frequency of 65% in patients with 
atopic dermatitis with food allergy.^ In this study we 
report the results of iiiutagenizing the 5 -immunodomi- 
nant epitopes on the P34/Gly m Bd 30K soybean for the 
identification of amino acids critical for IgE binding. The 
single-site amino acid substitution of the 5 epitopes 
revealed that IgE binding could be reduced or eliminated 
depending on the scrum of individual patients. 

Adverse reactions to various food products arc a com- 
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TABLE III. Summary of 6 patients' sera used to map mutagenized immunodominant epitopes 



Amino add position (alanine) 



Epitope 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


Weight 


Patient 12 
























1 
6 


+ 


+ 


+ 




+ 






-H 




+ 


4- 


13 




4- 


+ 


+ 


+ 




+ 


-1- 






4- 


15 


- 


- 


- 






- 


- 


- 


- 


- 




16 


+ 




+ 
















4- 


Patient 9 
























1 

6 






+ 


+ 








+ 


+ 


+ 


4- 


13 


-1- 




+ 




-1- 








-t- 




4- 


15 
16 




+ 


+ 














+ 


+ 


Patient 4 
























1 


+ 








+ 












4- 


6 


+ 














+ 






4- 


13 
15 


+ 




+ 




+ 


■f 


+ 


+ 


+ 


4- 


~ 
4- 


16 






+ 


+ 




■i- 




+ 


+ 


4- 


4- 


Patient 3 
























1 


+ 




+ 






+ 








4- 




6 






+ 


+ 


+ 








+ 


4- 


4- 


13 






+ 


4- 


+ 




+ 




-1- 


4- 


4- 


15 


+ 


+ 


+ 




+ 


+- 


+ 




-|- 


4- 


+ 


16 


+ 




+ 








+ 




+ 


4- 




Patient 5 
























] 




+ 


+ 


+ 




-1- 








4- 


4- 


6 






+ 


+ 








+ 


+ 


4- 


4- 


13 




-t- 


+ 


+ 










+ 


.+ 


4- 


15 




+ 




+ 


+ 


+ 


+ 


4- 




4- 


4- 


16 






+ 


+ 


+ 






+ 


-1- 


4- 


■(- 


Patient 8 
























1 


+ 




+ 






+ 


+ 


+ 


-f 


4- 


4- 


6 


+ 








+ 






+ 




4- 


4- 


13 


+ 






+ 




-1- 


4- 


+ 


+ 


4- 


4- 


15 


+ 


+ 




+ 


+ 




-1- 


4- 


+ 


+ 


+ 


16 


+ 


















4- 


4- 
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mon problem faced by primary care physicians. Among 
the numerous reactions ascribed to soy products, hyper- 
sensitivity, in contrast to enterocoUtis, is described as an 
abnomial immunologic response in which IgE is the prin- 
cipal imiminoglobulin. In most studies the diagnosis of 
soy allergy has been based on clinical evaluation or case 
histories reported by parents without substantial scientific 
diagnostic criteria. In a recent study 1% of children with 
allergic disease had documented soy allergy and all of 
them had atopic dcmiatitis. ^^ In related studies with a 3 - to 
5-year follow-up, the frequency of soy reactions was 
almost nonexistent,^"^ with other studies proclaiming a 
more transient adverse soy reaction. 1*^-^^ The reports sug- 
gest a witle discrepancy in soy reactions, ranging from 
nonexistent to transient and documented soy reactions. 
This discrepancy is manifested in patient selection criteria 
and the method used to docimient soy reactions. Our study 
involved patients with soy reactions clinically proved by 
DBPCFC. After diagnosis of a food allergy, the only 



proved therapy is elimination of the offending allergen. 
For those individuals with soybean allergy a more direct 
approach to treatment of this food allergy is desirable. 

Although the incidence of allergy to soybean proteins 
is quite low in comparison with other major food pro- 
teins, the gradually increasing consumption of soybean 
products makes the identification and characterization of 
major soy allergens a focus for investigation. Herian et 
dP identified 3 different patterns of immunoblots in adult 
patients with soybean allergy on the basis of molecular 
weights. Senim bound igE patterns were resolved into 
those individuals who had IgE-rccognizing proteins with 
molecular weights of 50 to 60 kd, 20 kd, or 14 kd. 
Awazuhara et aP* identified 5 proteins with molecular 
weights ranging from 20 to 78 kd in the whey and glob- 
ulin fraction of soybean extracts. None of these proteins 
had an IgE-binding frequency greater than 50%. In stud- 
ies with 8 serum samples from soy protein-allergic 
children, immunoblots to crude soy proteins demonstrat- 
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ed that IgE- and IgG -specific antibodies to crude soy 
extracts were elevated in patients with positive 
D BPCFC, but none of the soy fractions investigated were 
clearly more antigenic.^ In a study carried out in Japan, 
SDS-PAGE and inimunoblotting analyses revealed that 
19% of patients with atopic dermatitis possessed IgE 
antibodies specific for soybean proteins.** At least 1 6 pro- 
teins were recognized, with detection frequencies rang- 
ing from ] .4% to 2.9% for 2S and 1 IS components to as 
high as 65% for the 7S-globuhn fraction that contains 
Gly m Bd 3 OK. The remaining protein components had 
IgE binding frequencies of less than 30%. The variabili- 
ty in the number of epitopes recognized reflects both the 
polyclonal nature of the immune response and the sub- 
ject variability in epitope recognition. 

The tievelopment of methods to reduce the allergenic- 
ity of soybean proteins is required because no other food 
material is used in as many foods as soybean throughout 
the world. Hidden soybean allergens in processed foods 
have been shown to provoke severe reactions in sensitive 
individuals,^^ A combination of both genetically selected 
soybean cultivar mutants lacking the a- and a'-subunits 
of conglycinin and physicochemical separation methods 
to remove Gly m Bd 30K from soy milk has been 
described. Although this is a very simple method, effi- 
cient industrial application procedures have not been 
tlevelopcd to produce a nonallcrgenic or very-low-aller- 
genic soybean protein source. 

Another method to reduce the allergenicity of soy- 
bean proteins would be to introduce genes that would 
produce hypoallergenic versions of the allergens.23-25 
The intensity of binding to wild-type peanut allergen 
was routinely high, whereas binding to mutated epitopes 
was markedly decreased at specific sites in each of the 
peanut epitopes. In contrast, the IgE binding to wild- 
type P34/Gly rti Bd 30K epitopes did not show the same 
degree of intense binding as evidenced by autoradi- 
ographic analysis, nor could we mutate each of the epi- 
topes to non-lgE-binding peptides. Peanut sensitivity, 
which is a life- long, life-threatening hypersensitivity, is 
a more severe food allergy leading to anaphylactic 
episodes. Soybean hypersensitivity, which is usually 
outgrown by age 5 years or less, is associated more with 
gastrointestinal symptoms and normally does not lead to 
an anaphylactic episode. 

The elucidation of the IgE-binding epitopes on 
P34/Gly m Bd 3 OK, the major allergen of soybean in 
atopic dermatitis identified by Ogawa et al^ and the 
determination of amino acids essential to IgE binding 
could provide the infonnation necessary to alter the gene 
responsible for encoding a non-IgE-binding protein. 
Although other proteins are involved in soybean hyper- 
sensitivity, we chose the P34 molecule as our first soy- 
bean allergen to mutagenize and to test for insertion into 
the soybean genome. This approach -will require that we 
suppress the native gene and detemiine that our altered 
gene product will retain its native function and properties 
in the soybean plant and seed. We propose to test our 
hypothesis with use of in vitro transformation techniques 



to detenuine the potential of producing a soybean prod- 
uct with reduced P34 seaim IgE binding. Qur overall 
approach is to prepare moderately modified proteins by 
site-directed amino acid substitutions of soybean aller- 
gens and use of cosuppression of native allergens to pro- 
duce a hypoallergenic soybean plant. 
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Universitat Wien 
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1. INTRODUCTION 

In the last few years, the use of recombinant DNA techniques for the charac- 
terization of atopic allergens offered new aspects for diagnosis and therapy of Type I al- 
lergies (1). It has made it possible to produce large quantities of well characterized 
**wild-type" recombinant allergens, or first-generation recombinant products, and many 
of them are now being developed for diagnosis and possibly therapy of Type I allergies. 
For instance, recombinant Asp f 1 expressed in £. coii was successfully used for sero- 
logic and clinical diagnosis of A. fumigatus allergy (2). It has been demonstrated that the 
use of two recombinant birch pollen allergens. Bet v 1 and Bet v 2 (profilin), allows ac- 
curate iu vitro (ELISA, immunoblots) and in vivo (skin prick test, intradermal test) diag- 
nosis of birch pollen allergy (3,4). In addition, recombinant Bet v I could be efficiently 
used for identifying food cross-sensitization induced by Bet v I -related proteins (4). 
These studies demonstrate that recombinant allergens are adequate tools for in vivo and 
in vitro allergen-specific diagnosis, which might be considered as an important step to- 
wards allergen-specific therapy. Presently, specific-immunotherapy is performed using 
natural allergen extracts that may contain, besides the desired allergen, other unwanted 
components. 

In addition to their use for characterization of allergens, recombinant DNA tech- 
niques also offered the unique possibility of arbitrarily altering the nucleotide sequence of 
a gene and, subsequently, the sequence of the encoded protein in order to produce novel 
''mutant" proteins, or second generation recombinant products, displaying altered proper- 
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ties. Previously we have shown that isoforms of Cor a 1, the major hazel pollen allergen, 
displayed striking differences in their ability to bind IgE from allergic patients (5). Since 
these isoforms showed high amino acid sequence similarity, we speculated that the differ- 
ences in IgE-binding was a result of sequence dissimilarities. In particular, the exchange 
at position 10 from a threonine in Bet v la and Cor a 1/1 1 isoform to proline in Cor a 1/16 
isoform seemed to correlate to its lower IgE-binding capacity. In this study, we have 
tested this hypothesis using a PCR-based site-directed mutagenesis approach to produce a 
single amino acid exchange in the Cor a 1/16 isoallergen. 



2. MATERIALS AND METHODS 

2.L DNA Constructs 

The cDNAs coding for Bet v la (6) and Cor a 1/16 (5) were cloned in the pMW175 
expression vector (7). The amino acid exchange at position 10 (Pro->Thr) in Cor a 1/16 
was engineered by PCR-mediated mutagenesis using the following primers: 

Ncol mutagenic primer 5*-GGG CCATGG GTGTTTTCAATTACGAGGTT-GA- 
GACCACCTCCGTT-3\ base exchanged indicated in bold; Ncol site is underlined. 

EcoRI primer 5 '-CCC GAATTC TTAGTTGTATTCAGCAGAGTGTGCGAAO \ 
EcoRI site is underlined. 

The PCR was performed with 1 ng template (pMW175/Cor a 1/16 construct) and 
1 /iM of each primer, using 30 cycles of 1 min denaturation at 95°C, 1 min annealing at 
42°C, and 1 min extension at 72°C. The PCR product was digested with Ncol and EcoRI 
and subcloncd in the pMW175 expression vector. The resulting plasmids were used to 
transform competent E. coli BL2I cells. All PCR amplified products were sequenced ac- 
cording to the dideoxi chain termination method (8). 

2.2. Expression of Cor a 1/16, Cor a 1/16T10, and Bet v la 

£. coli BL21 cells containing the pMW175/Bet v la, pMW175/Cor a 1/16, and 
pMWI75/Cor a 1/16T10 plasmids were grown and expression of the recombinant aller- 
gens induced by adding IPTG to a final concentration of 1 mM. After incubation at 37'^C 
for 6 h, cells were harvested by centrifugation and lysed by repeated freezc-thaw cycles. 

2.3. SDS-Page and Immunoblots 

E. coli lysates of recombinant Cor a 1/16, Cor a 1/16T10, and Bet v la or purified 
proteins were analyzed by SDS-PAGE according to the method of Laemmli (9), using 
15% acrylamide gels. Proteins were visualized by staining with Coomassie Brilliant Blue 
R-250. For immunoblot analysis, proteins were separated by 15% SDS-PAGE and elec- 
troblotted onto nitrocellulose membranes. Immunoblots using a monoclonal anti-Bet v I, 
BIP 1, were performed as described previously (10). IgE immunoblots were carried out 
using sera from birch pollen allergic patients. Bound IgE was detected using '"^l-rabbit 
anti-human IgE. f. coli lysates harboring the plasmid without insert were used as a con- 
trol. In all experiments, reagents, and cell lysates were from identical batches and were 
used in the same concentrations. Autoradiography was performed at -70°C for 12-48 h 
with intensifying screens. 
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2-4, Purification of Recombinant Allergens 

rBet V la, i<:or a 1/16, and rCor a 1/16T10 proteins were purified from crude bacte- 
rial lysates by chromatofocusing and reversed-phase HPLC (10). Purified proteins were 
analysed by SDS-PAGE according to the method of Laemmli (9) and visualized by stain- 
ing with Coomassie Brilliant Blue R-250. 



2.5. Trypsin Treatment, and Matrix-Assisted Laser Desorption/ 
Ionization Mass Spectrometry (MALDI-MS) of Cor a 1/16 and 
Cor a 1/16T10 Proteolytic Fragments 

Purified Cor a 1 proteins (100 ^ig in distilled water) were heated for 20 min at 95X 
and diluted with an equal volume of 0.2 M NH^HCO^. One microgram of trypsin was 
added and the mixture incubated at 37°C for 2 h. Afterwards, trypsin was added again, 
and incubation continued for an additional 4-h period. The reaction was stopped by adding 
1/10 vol of trifluoracetic acid and dried in vacuo. The resulting peptide mixtures were sub- 
jected to MALDI-MS analysis using the HP G2025A system equipped with a nitrogen la- 
ser. 



2,6, T-Cell Proliferation Assays 



Isolation of Bet v I -specific T-cell clones from the peripheral blood of birch pollen 
allergic patients (as indicated by typical case history, positive skin tests, and positive 
RAST) and proliferation assays were done as previously described (11). 




PGR amplification 

Ncol 



EcoRI 



3' 



cloning 



Ncol 



EcoRi 



1^ 3' 



sequence analysis 



Figure 1. PCR mutagenesis of the Cor a 1/16 cDNA. The 5' Nco mutagenic primer and the 3' EcoRI primer were 
used in a PCR with the Cora 1/16 cDNA. 
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Figure 2. Expression, puriftcacion and immunoblot analysis of Cor a I/I6TI0 mutant protein, (a), Coomassic- 
siained 15% SDS-poIyacrylamide gel of lysaies of E. voli BL21 host strain containing Cor a 1/16, Cor a 1/16TI0, 
and Bet v la pMWI75 expression plasmids. Control, host strain BL21 lysaie containing the pMWl75 expression 
vector without an insert, (b), Coomassic-stained 15% SDS-polyacrylamide gel of purified rCor a 1/16, rCor a 
I/I6TI0 and rBet v la. (c) and (d). Immunoblots of purified rCor a 1/16, rCor a I/I6T1(), and rBei v la probed 
with BIP I, a monoclonal anti-Bet v 1. and with a polyclonal anti-rBet v la. respectively. 
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3. RESULTS AND DISCUSSION 

3.L Expression, Purification, and Mass Spectrometry Analysis of Cor a 
1/16 Proteins 

In this study, we have used PCR-mediated mutagenesis to generate a mutant of the 
Cor a 1/16 allergen. A mutant primer was used that had the proline codon at amino acid 
position 10 replaced by a threonine codon. The strategy for the construction of this Cor a 
1/16 mutant (Cor a 1/I6T10) is outlined in Figure 1. 

The cDNAs coding for Bet v la. Cor a 1/16 and Cor a 1/16T10 were subcloned in 
the pMWI75 expression vector and high-levels of recombinant non-fusion proteins were 
produced by induction with IPTG. Figure 2A shows a Coomassie-stained gel of expressed 
Betv la, Cora 1/16, and Cor a 1/ 1 6T 10 proteins. 

The recombinant proteins were purified from crude bacterial lysates using chromato- 
focusing and reversed-phase HPLC. The proteins appeared homogeneous as determined 
by SDS-PAGE and Coomassie-staining (Figure 28). 

In order to confirm at the protein level the sequence of the Cora 1/16T10 mutant aller- 
gen, purified rCor a 1/16 and rCor a 1/16T10 were digested with trypsin and the proteolytic 
fragments subjected to MALDI-MS. Nine peptides were detected for both Cor a I /1 6 and Cor 
a 1/16T10, and their molecular weights were determined from the obtained spectra (Table I). 
The observed mass signals could be easily matched with the molecular weight of peptides 
predicted from the amino acid sequence deduced from the published Cora 1/16 sequence (Tl, 
T4-T6, T9-10, T12, T 18-1 9). These peptides covered about 70% of the Cor a 1/16 sequence. 
All peptides detected by MALDI-MS of rCor a 1/16 digests were also detected in rCor a 



Table 1. Mass determination of tryptic fragments T1-T19 of rCor a 1/16TI0. Theoretical 
m/z values give the calculated masses of the peptides plus one proton [M+H]"" 



Fragment 


Sequence 


theor. 


observBd 


T1 


GVFNYEVETPSVISAAR 


1840.03 


1839.6 


Tl Pio-T 


GVFNYEVElftVlSAAR 


1844.02 


1843.2 


T2 


LFK 


407.53 




T3 


SYVLDGDK 


896.96 




T4 


LIPK 


470.62 


470.3 


T5 


VAPQAITSVENVGGNGGPGTIK 


2067.29 


2067.3 


T6 


NITFGEGSR 


981.04 


981.2 


T7 


YK 


310.36 




T8 


YVK 


409.49 




T9 


ERVDEVDNTNFK 


1466.64 


1466.6 


T10 


YSYTVIEGDVLGDKLEK 


1929 96 


1929.4 


T11 


VCSELK 


67881 




T12 


IVAAPGGGSTLK 


1071.25 


1071.2 


T13 


ISSK 


434.50 




T14 


FHAK 


502.58 




T15 


GDHEINAEEMK 


1273.35 




T16 


GAK 


275.32 




T17 


EMAEK 


607.69 




T18 


LLP 


401 52 


401.5 


T19 


AVETYLLAHSAEYN 


1581.71 


1581 .7 
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MGVFNYEVETttVISAARLFLSYVLDGDKUPKVAPQAITSVENVGGMGGPGTIKNITFGEGSRYKYVKERVOEVDNTNF 
LD I |T4 H T5 ILT6 I 1 T9 



80 90 100 no 120 130 140 150 159 

KYSYTVIEGDVLGOKLEKVCSELKIVAAPGGCSTLKISSKFHAKGDHEINAEEMKCAKEMAEKLLRAVETYLLAHSAEYN 

_iLl!P I I T12 ( lIieiLll9 f 

Figures. Trypiic pcpiides of KTor a I/I6TI0 {TI-TI9) identitled by MALDI-MS. The positions of peptides 
stimuialing specific T cell clones ( A-E) arc indicated by boxes above ihc sequence. 



1/) 6TI 0 digests, except for the signal at m/z 1 839 corresponding to the N-terminal peptide T ) 
(Table I). According to the Cora 1 /1 6T 1 0 cDN A sequence* peptide Tl should contain the sin- 
gle amino acid exchange (Pro —> Thr) when compared to Cor a 1/16. In trypiic digests of rCor 
a 1/16T10 a signal at m/z 1843 (T-PiO T) corresponded exactly to the expected mass of Tl 
with an exchange of proline for a threonine (Table 1 ). Figure 3 shows the recorded mass sig- 
nals of rCor a 1/16T10 proteolytic digests mapped onto the cDNA-derived Cora 1/ 1 6T 10 se- 
quence according to their molecular mass and enzyme specificity. Thus, the single amino acid 
substitution engineered in the Cor a 1/16 cDN A to produce Cora 1/1 6T 10 was also confirmed 
at the protein level by mass spectrometry analysis of proteolytic digests of rCor a 1/16T10 
mutant protein. 

3.2 Immunological Properties of Cor a 1/16T10 

To evaluate the antibody-binding properties of purified rCor a 1/16T10 mutant protein 
in comparison to rCor a 1 / 1 6 and rBel v 1 a, we performed immunoblotting experiments. 

Figures 2C and 2D show immunoblots of the purified proteins using a monoclonal 
anti-Bet v 1 antibody, BIP U (Fig. 2C) and a rabbit anti-rBet v la serum (Fig. 2D). Both 
antibodies showed strong reactivity to rBet v la. In contrast, BIP I did not react with rCor 
a 1/16 or with rCor a 1/16T10, and the polyclonal anti-Bet v la showed a weak reactivity 
to both rCora 1/16 and rCor a 1/16TI0 proteins. 

Immunoblots experiments using sera from birch pollen allergic patients showed re- 
markable differences in the IgE-binding properties of rCor a 1/16 and rCor a 1/16T10. 
Figure 4 shows the IgE-binding patterns of rBet v la, rCor a 1/16, and rCor a I/16T10 us- 
ing sera from five birch pollen allergic patients. All patients showed a marked increase in 
IgE-binding to rCor a 1/I6T10 mutant protein compared to wild type rCor a 1/16, except 
patient 2. For patients 1, 4, and 5, the replacement of proline at position 10 by a threonine 
resulted in a change of the antibody-binding pattern from "no-binding" to strong IgE-bind- 
ing. Interestingly, rCor a 1/I6T10 mutant protein in some cases displayed higher IgE- 
binding activity than rBet v la (patients 1, 2, 3» and 5). 

Presently, there are no data available on the 3D-siructure of Bet v I or homologous 
proteins. Also, there are no precise informations available about IgE-binding inotif(s) on 
the Bet v 1 or Bet v 1 -related allergens. However, there are indications that IgE-binding 
structures on the Bet v 1 molecule might be determined by the protein conformation ( 10), 

Among all standard amino acids, proline seems to occupy a unique position. Proline 
imposes strong conformational constraints on the peptide chain because the side-chain is 
cyclized back onto the backbone amide position. When present inside an alpha-helix, the 
possibility of making hydrogen bonds to the preceding turn is hindered and a kink of 20*" 
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Figure 4. IgE-binding to rCor a 1/J6T10, rCor a 1/16, and rBei v la. Immunoblot experiments showing serum 
fgE-reactivity of 5 birch pollen allergic patients. 



or more will be introduced in the alpha-helix. In addition, proline can introduce structural 
heterogeneity since the X-proJine (X being any amino acid) bond can assume either the 
stereoisomeric cis or trans conformation (for a review see 12). 

Taking into consideration the exceptional properties of proline, it is possible that the 
substitution in Cor a 1/16 of proline- 10 for a threonine residue might cause conforma- 
tional changes with the result of a dramatic increase in its IgE-binding activity. This corre- 
lates well with the fact that Bet v la, which displays high IgE-binding activity, has a 
threonine at position 10. It will be interesting to test whether the substitution of threonine- 
10 in Bet v la for a proline will lower its IgE-binding activity. 

3.3 Activation of Allergen-Specific T Cell Clones 

The ability of rCor a 1/16T10 to activate allergen-specific T cell clones was evalu- 
ated using Bet v la-specific T cell clones. We tested fourteen clones that were established 
from the peripheral blood of birch pollen allergic patients and were shown to recognize 
distinct epitopes (A-E) scattered over the whole Bet v la molecule (Fig. 5) (ref. II). 
Seven of these Bet v la-specific T cell clones also recognize the corresponding sequence 
on Cor a 1/16. The epitope recognized by the clone RR9 comprises the amino acid substi- 
tution on rCor a I/16T10. This clone reacted with both rCor a 1/16 and rCor a 1/I6TI0 
mutant protein. Except for the clone WD25, all other clones reacting with rCor a 1/16 also 
reacted with rCor a 1/16TI0. It is not clear why this clone failed to proliferate in response 
to rCor a 1/16T10. As shown in Fig. 5, this clone recognizes an epitope (epitope E) corre- 
sponding to the C-terminal region (142-156) of Bet v la and Cor a 1/16. The possibility 
of a mutation in the peptide epitope recognized by this clone can be ruled out since 
MALDI-MS analysis of proteolytic fragments of rCor a 1/16T10 confirmed the structural 
integrity of the region corresponding to epitope E (see Fig. 3 and Table 1 ). It is conceiv- 
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Figure 5. Proliferative responses of human Bet v i-speciflc T cell clones (TCC) to rCor a 1/16. rCor a I 'I6TI(). 
and rBet v la. On top, the black bar represents the Bel v la amino acid sequence. The positions of the epitopes (A- 
E) recognized by the TCC are indicated by boxes above the Bet v I a sequence. 



able that the single amino acid exchange at position iO in Cor a 1/16T10 can affect the 
conformation of the protein to an extent that the processing by antigen presenting cells is 
different and a non-reactive peptide is created or that the epitope is destroyed. In this re- 
spect, Finnegan and Amburgey (13) showed that a single amino acid change in the staphy- 
lococcal nuclease protein affects the structure of the processed peptides in such a manner 
that stimulatory determinants are no longer presented to certain T cell clones. 

4. CONCLUSIONS 

The results presented here suggest that it is possible to modulate the IgE-binding 
properties of tree pollen allergens by single amino acid substitutions at crucial positions. 
This finding makes it possible to develop second-generation of recombinant allergens with 
antibody-binding properties specifically modulated for diagnosis and for therapy. 

Following this line, we are presently testing the effect of single amino acid ex- 
changes on the IgE-binding properties of Bet v la, which were based on the patterns of 
amino acid substitutions of Bet v 1 isoforms displaying low IgE-binding properties. 

The identification of positions crucial for IgE binding might be facilitated by clon- 
ing and sequencing isoforms of a particular allergen and determining their IgE-binding 
properties. In addition, isoforms with low or no IgE-binding activity could be useful tools 
for defining IgE-binding structures on allergens. 
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A major peanut allergen, Ara h 2, is recognized by 
scrum IgE from >90% of patients vi^ith peanut hyper- 
sensitivity. Biociiemical cliaractcrization of this aller- 
gen indicates that it is a glycoprotein of —17.5 kDa. 
Using N-terniinal amino acid sequence data from puri- 
fied Ara h 2, oligonucleotide primers were synthesized 
and used to identify a clone (741 bp) from a peanut 
cDNA library. This clone was capable of encoding a 
17.5 -kDa protein with homology to the conglutin fam- 
ily of seed storage proteins. The major linear immuno- 
globulin E (IgE)-binding epitopes of this allergen were 
mapped using overlapping peptides synthesized on an 
activated cellulose membrane and pooled serum IgE 
from 15 peanut-sensitive patients. Ten IgE-binding 
epitopes were identified, distributed throughout the 
length of the Ara h 2 protein. Sixty-three percent of 
the amino acids represented in the epitopes were ei- 
ther polar uncharged or apolar residues. In an effort 
to determine which, if any, of the 10 epitopes were 
recognized by the majority of patients with peanut hy- 
persensitivity, each set of 10 peptides was probed indi- 
vidually with scrum IgE from 10 different patients. All 
of the patient sera tested recognized multiple epitopes. 
Three epitopes (aa27-3G, aa57-66, and aa65-74) were 
recognized by all patients tested. In addition, these 
three peptides bound more IgE than all the otlier epi- 
topes combined, indicating that they are the immuno- 
dominant epitopes of the Ara h 2 protein. Mutational 
analysis of the Ara h 2 epitopes indicate that single 
amino acid changes result in loss of IgE binding. Two 
epitopes in region aa57-74 contained the amino acid 
sequence DPYSP that appears to be necessary for IgE 
binding. These results may allow for the design of im- 
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4301 W. Markham. Little Rock, AR 72205. Fax: (501) 686-8169. 
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proved diagnostic and therapeutic approaches to pea- 
nut hypersensitivity. © 1997 Academic Press 



Immediate hypersensitivity reactions to foods occur 
in 6-8% of children and about 1% of adults (I, 2) and 
are mediated by the production of immunoglobulin E 
(IgE)^ antibodies to glycoproteins present in the food 
(3). Peanuts are a major cause of serious allergic reac- 
tions in both children and adults. The hypersensitivity 
to peanuts often starts in childhood and continues 
throughout life. This is in contrast to other childhood 
food allergies such as to milk and eggs which generally 
resolve spontaneously with age (4). In addition, peanut 
allergy is more likely to cause fatal anaphylaxis than 
any other food allergy (5-7). Currently, avoidance is 
the only effective means of dealing with food allergy, 
but the use of peanuts and peanut by-products as sup- 
plements in many different foods makes accidental con- 
sumption almost inevitable. Thus, the prevalence and 
chronic nature of peanut allergy, the potential severity 
of the allergic reaction, and the widespread use of pea- 
nuts in consumer foods necessitates improved methods 
for managing peanut hypersensitivity. 

Food hypersensitivity reactions occur shortly after 
contact of a specific allergen with its corresponding IgE 
antibodies which are bound to mast cells. Cross-linking 
of the allergen-specific IgE by the respective allergen 
activates the mast cells to release histamine, heparin, 
and other mediators responsible for the clinical symp- 
toms observed. Thus, the IgE-binding epitopes of the 
allergens play an important role in the disease process. 

"^Abbreviations used: IgE, immunoglobulin E; DMF, A/,AAdimGth- 
ylformamide; TBS, Tris-buffered saline. 

0003-98r3l/97 $25.00 
Copyright © 1997 by Academic Press 
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If improved diagnostic and therapeutic capabilities are 
to be developed, it is important to determine tiie pri- 
mary structure of the major allergens, the IgE-binding 
sites of these allergens, and the frequency of recogni- 
tion of any IgE-binding epitopes. Characterization of 
allergens will provide a better understanding of the 
liuman immune response involved in food hypersensi- 
tivity reactions. 

Several studies have shown that the most allergenic 
portion of the peanut is the protein fraction of the coty- 
ledon (8- 10). Two highly abundant glycoproteins found 
in the cotyledon are the peanut allergens, Ara h 1 and 
Ara h 2. These proteins are recognized by serum IgE 
from >90% of peanut-sensitive patients, thus estab- 
lishing them as important allergens (11, 12). The ma- 
jority of serum IgE recognition of the Ara h 1 and Ara 
h 2 allergens appear to be due to epitopes within these 
proteins that are linear amino acid sequences that do 
not contain significant amounts of carbohydrate (13). 
We have recently described the cloning of the gene en- 
coding the Ara h 1 allergen and identified it as a seed 
storage protein belonging to the vicilin family of legume 
storage proteins (13). 

In this paper we report the cloning and nucleotide 
sequence of the second major peanut allergen, Ara h 
2. The derived amino acid sequence has been used to 
construct syn thetic peptides and perform a detailed ex- 
amination of the linear IgE binding epitopes of this 
protein. These results may allov^ for the design of im- 
proved diagnostic and therapeutic approaches to pea- 
nut hypersensitivity. 

MATERIALS AND METHODS 

Serum IgE. Scrum from 15 patients with documented peanut 
hypersensitivity (mean age, 25 years) was used to identify peanut 
allergens. Each of these individuals had a positive immediate skin 
prick test to peanut and either a positive double-blind, placebo-con- 
trolled, food challenge or a convincing history of peanut anaphylaxis 
(laryngeal edema, severe v^heezing, and/or hypotension). Details of 
the challenge procedure and interpretation have been discussed pre- 
viously (11). Representative individuals with elevated serum IgE 
levels (who did not have peanut specilic IgE or peanut hypersensitiv- 
ity) were used as controls in these studies. At least 5 ml of venous 
blood was drawn from each patient and allowed to clot, and the 
serum was collected. All studies were approved by the Human Use 
Advisory Committee at the University of Arkansas for Medical Sci- 
ences. 

IsoiBtion and amino acid sequence analysis of peanut allergen Ara 
h 2, Ara h 2 was purified to near homogeneity from whole peanut 
extracts according to the methods of Burks et al. (12). Purified Ara 
h 2 was elcctrophorescd on 12.5% acrylamidc mini-gels (Bio-Rad 
Laboratories, Hercules, CA) in Tris/SDS/glycine buffer. The gels were 
stained with 0. 1 % Coomassie blue in 10% acetic acid and 50% metha- 
nol and destained in 40% methanol for 3 h with continuous shaking. 
Gel slices containing Ara h 2 were sent to the W. M. Keck Foundation 
(Biotechnology Resource Laboratory, Yale University, New Haven, 
CT) for amino acid secjuencing. Amino acid sequencing of intact Ara 
h 2 and tryptic peptides of this protein was performed on an Applied 
Biosystoms sequencer with an on-line HPLC column that was eluted 
with increasing concentrations of acetonitrile. , 



Peanut RNA isolation and Northern (IWA) gels. Three commer- 
cial lots from the 1979 crop of medium grade peanut species, Arachls 
hypogaea (Florunner), were obtained from North Carolina State Uni- 
versity for this study. Total RNA was isolated from 1 g of this mate- 
rial according to procedures described by Larsen (14). Poly(A)^ RNA 
was isolated using a purilication kit (Collaborative Research, Bed- 
ford, MA) according to manufacturer's instructions. Poly (A) RNA 
was subjected to electrophoresis in 1.2% formaldehyde agarose gels, 
transferred to nitrocellulose, and hybridized with "^P-labeled probes 
according to the methods of Bannon et al. (15). 

Computer analysis of Ara h 2 sequence. Sequence analysis of the 
Ara h 2 gene was done on the University of Arkansas for Medical 
Science's Vax computer using the Wisconsin DNA analysis software 
package. The algorithm of Needleman and Wunsch was used to align 
the complete amino acid sequence of Ara h 2 with homologous pro- 
teins before determining the percent identity. 

cDNA expression library construction and screening. Pcianut 
poly(A)^ RNA was used to synthesize double-stranded cDNA ac- 
cording to the methods of Watson and Jackson (17) and Huynh et al. 
(18). The cDNA was treated with EcoX<\ mcthylase and then ligated 
with EcoKl and Xbol linkers. The DNA was then ligated with Eco\i\- 
Xbo\ cut, phosphatase treated \-ZAP XR phage arms (Stratagene, 
Lajolla, CA), and in,viti^o packaged. The library was 95% recombi- 
nants carrying insert sizes >400 bp. The libraiy was screened using 
an IgE antibody pool consistJng of an equal volume of serum from 
each patient with peanut hypersensitivity. Detection of primary anti- 
body was with I'^^-labeled anti-IgE antii)ody performed according to 
the manufacturer's instructions (Sanoli, Chaska, MN). 

PCR amplification of the Ara h 2 mRNA sequence. Using the 
oligonucleotide CA(AG)CA(AG)TGGGA(AG)TT(AG)CA(AG)GG(N)- 
GA(TC)AG derived from amino acid sequence analysis of the Ara h 2 
peanut allergen as one primer and a 23-nt: primer wliicii iiybridizes 
to the Bluescript vector, the cDN A that encodes Ara h 2 was amplified 
from the IgE-positive clones. Reactions were carried out in a buffer 
containing 3 mM MgClg, 500 mM KCl, and 100 mM Tris-HCI, pH 9.0. 
Each cycle of the polymerase chain reaction consisted of 30 s at 95°C, 
followed by I min at 56°C, and 2 min at 72°C. Thirty cycles were 
performed with botJi primers present in all cycles. From tliis reaction, 
a clone carrying an approximately 700-bp insert was identified. 

DNA sequencing and analysis. DNA sequencing was done ac- 
cording to the methods of Sanger et al. (19) using either '^'^P-cnd 
labeled oligonucleotide primers or a automated ABI model 377 DNA 
sequencer using fluorescent tagged nucleotides. Most areas of the 
clone were sequenced at least twice and in some cases in both direc- 
tions to ensure an accurate nucleotide sequence for the Ara h 2 gene. 

Peptide synthesis. Individual peptides were synthesized on a deri- 
vatised cellulose membrane using Fmoc amino acid active esters 
according to the manufacturer's instructions (Genosys Biotechnolo- 
gies, Woodlands, TX). Fmoc-amino acid derivatives were dissolved 
in l-methyl-2-pyrrolidone and loaded on marked spots on the mem- 
brane. Coupling reactions were followed by acetylation with a solu- 
tion of 4% (v/v) acetic anhydride in yV,AAdimethylformamide (DMF). 
After acetylation, Fmoc groups were removed by incubation of the 
membrane in 20% (v/v) piperdine in DMF. The membrane was then 
stained with bromophenol blue to identify the location of the free 
amino groups. Cycles of coupling, blocking, and deprotection were 
repeated until the peptides of the desired length were synthesized. 
After addition of the last amino acid in the peptide, the amino acid 
side chains were deprotccted using a solution of dichloromethane/ 
trifluoroacetic acid/triisobutlysilane (1/1/0.05). Membranes were ei- 
ther probed immediately or stored at -20^C until needed. 

IgE binding assay. Cellulose membranes containing synthesized 
peptides were washed with Tris-buffered saline (TBS) and then incu- 
bated with blocking solution overnight at room temperature. After 
blocking, the membranes were incubated witli serum from patients 
with peanut hypersensitivity diluted (1:5) in a solution containing 
TBS and 1% bovine serum albumin for at least 12 h at 4°C or 2 h at 
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TABLE I 

Amino Acid Sequence of Ara h 2 Peptides 



Peptide 


Amino acid sequence 


I 


X-Q-Q-W-E-L-Q-G-D-R-R-R-Q-S-Q-L-E-R 


n 


A-N-L-R-P-C-E-Q-H-L-M-Q-K 



Note. Th^ amino acid sequence of the amino terminus (I) and a 
tryptic peptide (IJ) derived from Ara h 2 protein was determined. 
The sequence is siiown as tiie one-letter amino acid code. X indicates 
an amino acid that was unable to be assigned. 



room' temperature. Primary antibody was detected with '^^Tlabeled 
anti-TgE antibody (Sanofi). 

RESULTS 

Isolation and partial amino acid sequence determina- 
tion of the Ara tj 2 protein. The amino terminus of the 
purified Ara Ii 2 protein, or peptides resulting from 
trypsin digestion of tliis protein, were used for amino 
acid sequence determination. The amino acid sequence 
representing the amino terminus of the Ara h 2 protein 
(peptide I) and a tryptic peptide fragment (peptide II) 
is noted in Table I. It was possible to determine the 
first 17 residues from peptide I and the first 13 residues 
from peptide II of the major peptide in each fraction. 



These results confirm and extend previous amino acid 
sequence analysis of the Ara h 2 protein (12). 

Identification and cJiaracterization of clones that en- 
code peanut allergen Ara h Z RNA isolated from the 
Florunner variety of peanuts {A. hypogaea) was used 
to construct an expression librciry for screening with 
serum IgE from patients with peanut hypersensitivity. 
Numerous IgE binding clones were isolated from this 
library after screening 10^ clones with serum IgE from 
a pool of patients with reactivity to most peanut aller- 
gens by Western blot analysis. Since the number of 
plaques reacting with serum IgE was too large to study 
all in detail, we randomly selected 63 positive clones 
for further analysis. The inserts from each of these 
clones were then amplified using vector-specific prim- 
ers and PGR, separated by agarose gel electrophoresis, 
and blotted onto nitrocellulose. To identify which of the 
clones encoded the Ara h 2 allergen, a hybridization 
probe was constructed using a radioactive oligonucleo- 
tide [CA(AG)CA(AG)TGGGA(AG)TT(AG)CA(AG)GG(N)- 
GA(TC)AG] developed from amino acid sequence deter- 
mined for peptide T and used to probe the amplified 
inserts. Utilizing this approach, two plaques with 
— 700-bp inserts were identified. DNA sequence re- 
vealed that the selected clones carried identical 741: 
base inserts which included a 21 -base poly (A) tail and 
a 240 base 3' noncoding region. This insert contained 
a large open reading frame starting with an CTG codon 
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FIG. 1. Nucleotide sequence Of An Ara h 2 cDNA clone. The nucleotide sequence is shown on the first line. The second line is the derived 
amino acid sequence. Amino acid residues in bold correspond to the determined amino acid sequences of peptide I and U Ara h 2 (Table 
1). The numbers on the right of the figure indicate the position of the nucleotide sequence relative to the first nucleotide in the inset. The 
numbers on the left of the figure Indicate the position of the amino acid sequence relative to the first amino acid encoded by this clone. 
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TABLE II 

Ara h 2 Sequence Similarities 



Protein 


Source 


% Similar 


Conglutin-f!) 


Lupin 


39 


Mabinlin I (chain B) 


Caper 


32-35 


2S albumin 


Sunflower 


34 




Castor bean 


30 


rv-Amylase inhibitor 


Wheat 


29 


CMS protein 


Wheat 


27 



Note. The Ara h 2 nucleotide and derived amino acid sequences 
were used to search the GcnBank, Swiss-Prot, and EMBL databases 
for any homologous proteins. The table lists the proteins that had 
the highest similarity to the Ara h 2 sequence, the plant source of 
those proteins, and the percentage similarity between that protein 
and Ara h 2. 



and ending with a TAA stop codon at nucleotide posi- 
tion 474 (Fig. 1). The calculated size of the protein en- 
coded by this open reading frame was '^17.5 kDa, 
which is in good agreement with the molecular weight 
of Ara h 2 that has been determined experimentally 



(12). With the exception of a single cysteine residue at 
position 30, the amino acid sequence that was deter- 
mined from the purified Ara h 2 protein (Table I) was 
found in this clone. The additional coding region on the 
amino terminal end of this clone probably represents 
a signal peptide which would be cleaved from the ma- 
ture Ara h 2 allergen. 

To determine what size mRNA this clone identified, 
a ^^PTabeled insert was used as a hybridization probe 
of a Northern blot containing peanut poly(A)^ RNA 
(data not shown). This insert hybridized to an -0.7-kb 
mRNA, The size of the cloned insert and the size of the 
mRNA were in good agreement. In addition, there was 
good agreement between the calculated and deter- 
mined size of the Ara h 2 protein. Furthermore, the 
identity of the determined amino acid sequence from 
the Ara h 2 peptides agreed with that which was deter- 
mined from the clone. From these data we concluded 
that an Ara h 2 specific clone had been isolated. 

Peanut allergen Ara h 2 is a conglutin-Uke seed stor- 
age protein. A search of the GenBank, Swiss-Prot, 
and EMBL databases revealed significant amino acid 
sequence homology between the Ara h 2 protein and 
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FIG. 2, Multiple IgE-binding sites identified in the /j 2 allergen. Epitope analysis was performed on the Ara h 2 allergen by synthesizing 
15 amino acid long peptides, offset from each other by 8 amino acids for the entire protein molecule. These peptides, represented as spots 
1-19, were then probed with a serum pool consisting of 15 patients with documented peanut hypersensitivity. The Ara h 2 amino acid 
sequence is shown as the one-letter amino acid code in the bottom half of this figure. The shaded boxes (Dl -D3} correspond to the determined 
IgE binding regions. 
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! 
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iRDPXSPjSCDP 

FySpscdpys 

SPS(;i)PY§RS 

s(;dpy5rspy . 

FIG. 3, Core TgE-hinding epitopes identified on tlie Ara h 2 aller- 
gen. Epitope analysis was performed on the IgE-binding sites identi- 
fied in Fig. 2 by synthesizing 10-amino-acid-long peptides offset by 
two amino acids. These peptides were then probed with the 15-pa- 
tient scrum pool. (A) The peptide analysis of Ara h 2 amino acid 
residues 55-76. This region contains peptides G and 7 identified in 
Table ITT (B) The amino acid sequence of this region. 

seed storage proteins from a variety of different plants 
(Table II). The highest percent identity (40%) was ob- 
served between the Ara b 2 protein and conglutin-6, a 
sulfur-rich protein from the lupin seed (20). 2S albu- 
mins and mabinlins also had a high degree of homology 
(30-35%) with the /Ira 2 protein sequence (21). Inter- 
estingly, the Ara h Z protein had some similarity (26- 
29%) with a-amylase inhibitors from wheat (22» 23), 
which are the major allergens in baker s asthma (24, 
25) and are important allergens in patients experienc- 
ing hypersensitivity reactions following the ingestion 
of wheal: protein (26). 

Multiple IgE binding epitopes on the Ara h 2 protein. 
Nineteen overlapping peptides representing the de- 
rived amino acid sequence of the Ara h 2 protein were 
synthesized to determine which regions were recog- 
nized by serum IgE. Each peptide was 15 amino acids 
long and was offset from the previous peptide by 8 
amino acids. In this manner, the entire length of the 
Ara /? 2 protein could be studied in large overlapping 
fragments. These peptides were then probed with a 
pool of serum from 15 patients with documented pea- 
nut hypersensitivity or serum from a representative 
control patient with no peanut hypersensitivity. Serum 
IgE from the control patient did not recognize any of 
the synthesized peptides (data not shown). In contrast. 
Fig. 2 shows that there are three IgE binding regions 
along the entire length of the Ara h 2 protein that are 
recognized by this population of patients with peanut 
hypersensitivity. These IgE-binding regions represent 
amino acid residues 17-39, 41-80, and 114-157. 

In order to determine the exact amino acid sequence 
of the IgE binding regions, small peptides (10 amino 
acids long offset by two amino acids) representing the 
larger IgE-binding regions were synthesized. In this 
manner it was possible to identify individual IgE-bind- 



ing epitopes within the larger IgE-binding regions of 
the Ara h 2 molecule (Fig, 3). The 10 IgE-binding epi- 
topes that were identified in this manner are shown in 
Table III. The size of the epitopes ranged from 6 to 10 
amino acids in length. Three epitopes (aal5-24, aa21- 
30, and aa27-36), which partially overlapped with each 
other, were found in the region of amino acid residues 
17-39. Four epitopes (aa39-48, aa49-58, aa57-66, and 
aa65-74) were found in region 41-80. Finally, three 
epitopes (aal 15-124, aal27-136, and aal43-152) were 
found in the region represented by amino acid residues 
114-157. Sixty-three percent of the amino acids repre- 
sented in the epitopes were either polar uncharged or 
apolar residues. There was no obvious amino acid se- 
quence motif that was shared by all the epitopes, with 
the exception of epitopes 6 and 7, which contained the 
sequence DPYSP. 

Identification of the immunodominant Ara h 2 epi- 
topes. In an effort to determine which, if any, of the 10 
epitopes was immunodominant, each set of 10 peptides 
was probed individually with serum IgE from 10 differ- 
ent patients. Five patients were randomly selected 
from the pool of 15 patients used to identify the com- 
mon epitopes, and 5 patients were selected from out- 
side this pool. Figure 4A shows an immunoblot strip 
containing these peptides that has been incubated with 
an individual patient's serum. This patient's serum IgE 
recognized peptides 1, 3, 4, 6, and 7. The remaining 
patients serum IgE were tested in the same manner 
and the intensity of IgE binding to each spot was deter- 
mined as a function of that patient's total IgE binding 
to these 10 epitopes (Fig. 4B) All of the patient sera 
tested (10/10) recognized multiple peptides. Epitopes 
3, 6, and 7 were recognized by serum IgE of all patients 
tested (10/10). In addition, serum IgE that recognizes 
these peptides represent the majority of Ara h 2-spe- 
cific IgE found in these patients. These results indicate 



TABLE III 

Ara h 2 IgE Binding Epitopes 



Peptide 


A A vSequence 


Ara h 2 position 


1 


HASARQQWEL 


15-24 


2 


QWELQGDRRC 


21-30 


3 


DRRCQSQLER 


27-3G 


4 


LRPCEQHLMQ 


39-48 


5 


KIQRDEDSYE 


49-58 


6 


YERDPYSPSQ 


57-66 


7 


SQDPYSPSPY 


65-74 


8 


DRLQGRQQEQ 


115-124 


9 


KRELRNLPQQ 


127-136 


10 


QRCDLDVESG 


143-152 



Note. The Ara li 2 IgE-binding epitopes (1-10) are indicated as 
the l3old, underlined, single-letter amino acid code. The position of 
each peptide with respect to tfie Ara h 2 protein is indicated in the 
right-hand column. 
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Peptides 
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Ara h 2 Peptides 

FIG. 4, Identilkation of the immunodominant Ara h 2 epitopes. IgE-binding peptides (1-10) were synthesized as 10 amino acid peptides 
and prolied with scrum IgE from 10 patients with documented peanut hypersensitivity. IgE binding to individual peptides was detected 
by '^^Mabeled anti-human IgE. (A) A representative SPOTS membrane containing Ara h2 epitopes 1-10 and probed with serum IgE from 
a single patient. (B) The relative intensity of IgE binding to each peptide by individual patients was then determined by densitometry and 
expressed as a percentage of that patient's total IgE binding to all of the Ara h 2 peptides. Five patients were selected at random from tiic 
15-patient scrum pool and 5 were patients with peanut hypersensitivity that were not included in the pool. 
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Ara h 2 Peptides 

FIG. 5. All of the Ara h 2 peptides can be mutated to non-JgE-binding epitopes by single amino acid changes. All of the Am h 2 epitopes 
(peptides 1-10) were synthesized with an alanine residue substituted for one of the amino acids and then probed with a pool of serum IgE 
from 15 patients with documented peanut hypersensitivity. IgE binding to individual peptides was detected by '^"^I- labeled anti-human IgE. 
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that: peptiides 3, 6, and 7 contain the immunodominant 
epitopes of tlie Ara h 2 protein. 

The Ara h 2 peptides can be mutated to non-IgE- 
binding epitopes by single amino acid clianges. To as- 
sess the importcuice of individual amino acids in each 
of the Ara h 2 epitopes they were synthesized as 10 
amino acid residue peptides with alanine residues be- 
ing substituted one at a time for each of the amino 
acids in the epitope. These peptides were then probed 
with pooled serum IgE from 15 patients with docu- 
mented peanut hypersensitivity. Figure 5A shows an 
immunoblot strip containing the wild type and mu- 
tated peptides of epitope 7. The pooled serum IgE did 
not recognize this peptide or binding was drastically 
reduced when alanine was substituted for amino acids 
at position 67, 68, or 69. In contrast, the substitution 
of an alanine for a serine residue at position 70 resulted 
in increased IgE binding. The remaining Ara ii 2 epi- 
topes were tested in the same manner and the intensity 
of IgE binding to each spot was determined as a per- 
centage of IgE binding to the wild-type peptide (Fig. 
5B). In general, each epitope could be mutated to a 
non-IgE-binding peptide by the substitution of an ala- 
nine for a single amino acid residue. There was no obvi- 
ous position within each peptide that, when mutated, 
would result in loss of IgE binding. Furthermore, there 
was no consensus in the type of amino acid that, when 
changed to alanine, would lead to loss of IgE binding. 

DISCUSSION 

Peanuts are one of the most common food allergens 
in both children and adults. In addition, peanut hyper- 
sensitivity is less likely to resolve spontaneously and 
more likely to result in fatal anaphylaxis. Because of 
the significance of the allergic reaction and the widen- 
ing use of peanuts as protein extenders in processed 
food, the risk to the peanut-sensitive individual is in- 
creasing. 

Various studies over the last several years have ex- 
amined the nature and location of the multiple aller- 
gens in peanuts (8-10). Taylor et al (9) demonstrated 
that the allergenic portion of peanuts was in the pro- 
tein portion of the cotyledon. Our laboratory recently 
identified two major allergens from peanut extracts, 
designated Ara II I (11) and Ara h 2 (12). Greater than 
90% of our patients who were challenge positive to pea- 
nut had specific IgE to these proteins. The Ara h 1 
allergen has been identified as a seed storage protein 



with sequence homology to the viciHns (13), a family of 
proteins commonly found in many higher plants (27, 
28). The Ara h 2 nucleotide sequence identified in this 
report has sequence homology with another class of 
seed storage proteins called conglutins (20). It is inter- 
esting to note that two of the major peanut allergens 
thus far identified are seed storage proteins that have 
sequence homology with proteins in other plants. This 
may explain the cross-reacting antibodies to other le- 
gumes that are found in the sera of patients that mani- 
fest clinical symptoms to only one member of the le- 
gume family (29). 

There are at least 10 IgE recognition sites distributed 
throughout the major peanut allergen Ara Ii 2. The 
identification of multiple epitopes on a single allergen 
is not novel, there being reports of multiple IgE binding 
epitopes on allergens from many foods that cause im- 
mediate hypersensitivity reactions (30-36). The obser- 
vation that most of these proteins have multiple IgE- 
binding sites probably reflects the polyclonal nature of 
the immune response to them and may be a necessary 
step in establishing a protein as an allergen. 

Recent evidence suggests that there is a preferential 
variable heavy chain usage in IgE synthesis and a di- 
rect switching from IgM production to IgE synthesis 
(37). This would suggest that epitopes responsible for 
antigen-specific IgE antibody production may differ 
from those promoting antigen-specific IgG antibodies 
and that there may be some structural similarity be- 
tween peptides that elicit IgE antibody production. 
However, there was no obvious sequence motif that was 
shared by the 23 different IgE-binding epitopes of the 
peanut allergen Ara Ii \ (13). In the present study, t wo 
epitopes in Ara h 2 share a hexameric peptide 
(DPYSPS). It is significant to note that these peptides 
are recognized by serum IgE from all the peanut hyper- 
sensitive patients tested in this study. In addition, se- 
rum IgE that recognize these peptides represent the 
majority of Ara Ii 2-specific IgE found in these patients. 
Whether there is any further structural similarity be- 
tween the IgE binding epitopes of Ara Ii 2 remains to 
be determined. 

The elucidation of the major IgE binding epitopes on 
Ara h 2 may enable us to design better therapeutic 
options for the prevention of anaphylaxis as a result of 
peanut hypersensitivity. The only therapeutic option 
presently available for the prevention of a food hyper- 
senitivty reaction is food avoidance. Unfortunately, for 
a ubiquitous food such as peanut, the possibility of an 



{A} A representative SPOTS membrane containing the wild-type (WT) peptide 7 and the mutated forms in which the amino acid at each 
position was substituted with alanine. (B) The relative intensity of IgE binding to the mutated forms of each peptide was then determined 
by densitometry and expressed as a percentage of IgE binding to the wild-type peptide. The letters and numbers across the top of (A) and 
across the bottom of each panel in (B) indicate the one-letter amino acid code for the residue normally at that position, the position of each 
residue in the Ara h 2 protein, and the amino acid that was substituted at that position. WT, indicates the wild-type peptide (no amino 
acid substitutions). 



'252 



STANLEY ET AL. 



inadvertent ingestion is great. One therapeutic option 
used extensively for patients with allergic reactions to 
various aeroallergens and insect sting venoms is aller- 
gen desensitization immunotherapy. Allergen immu- 
notherapy consists of injections of increasing amounts 
of allergens to which a patient has Type I immediate 
hypersensitivity (38, 39). While the absolute mecha- 
nism of immunotherapy is unknown, an increase in 
IgG or IgG4 antibody activity, a decrease in allergen- 
specific IgE levels, and a decrease in basophil activity 
have all been implicated in mediating this response 
(40-43). Because allergen immunotherapy has been 
proven efficacious for treatment of some allergies, 
treatment with peanut immunotherapy is now being 
studied as a possible option (44). Our work showing 
that the IgE-binding epitopes of a major peanut aller- 
gen may allow for the use of immunodominant epitopes 
in this approach. 

Finally, our data show that it may be possible to 
mutate the Ara h 2 allergen to a protein that no longer 
binds IgE. This raises the possibility that an altered 
Afci h 2 gene could be used to replace its allergenic 
homologue in the peanut genome. In this manner, a 
hypoallergenic peanut could be developed that may 
blunt allergic reactions in sensitive individuals who 
inadvertently ingest this food. Such an approach would 
require that the altered Ara h 2 gene product retain 
its native function and properties. Given the potential 
severity of peanut allergic reactions and the wide- 
spread use of peanuts in consumer foods, this possibil- 
ity is actively being pursued in our laboratories. 

ACKNOWLEDGMENTS 

This work was supported by grants from the Arkansas Science 
and Technology Authority, National Institute of Health Grant ROl- 
AI33596, and the Clarissa Sosin Allergy Research Foundation. 

REFERENCES 

1. Burks, A. W., and Sampson, H. A. (1993) Curr Prob. Pediatr. 
23, 230-252. 

2. Janscn, J. J. N., Kardinaai, A. F. M., Huijbers, G. H., Vlieg-Boer- 
stra, B. J., Martens, B. P. M., and Ochuizcn, T. (1994) J. Allergy 
Clin. Immunol. 93, 446-45G. 

3. Yunginger, J, W., and Jones, R. T. (1987) A review of peanut 
chemistry: Implications for the standardization of peanut ex- 
tracts, in Regulatory Control and Standardization of Allergenic 
Extracts: Proceedings of the Fourth International Paul Ehrlich 
Seminar. (Schaeffer, M., Sisk, C, and Brede, H. D., Eds.), pp. 
251-264, Washington, DC. 

4. Bock, S. A. (1985) J. Pediatr. 107, 676-680. 

5. Sampson, H. A., Mendelson, I., and Rosen, J. P. (1992) N. Engl. 
J. hded. 327, 380-384. 

6. Yunginger, J. W., Sweeney, K. G., Sweeney, W. Q., Giannandrea, 
L. A., 1 eigland, J. D., Bray, M., Benson, P. A.. York, J. A., 
Biedrzycki", L., Squillace, D. L., and Helm. R. M. (1988) J. Am. 
Med. Assoc. 260, 1450-1452. 

7. Yunginger, J. W., Squillace, D. L., Jones, R. T., and Helm, R. M. 
(1989) Allergy Proc. 10, 249-253. 



8. Bush, R. K., Taylor, S. L.. and Nordlec, J. A. (1 989) Allergy Proc 
10, 261-264. 

9. Taylor, S. L., Busse, W. W., Sachs, M. I., Parker, J. L., and Yun- 
ginger, J. W. (1981) J. Allergy Clin. Immunol. fi8, 373-375. 

10. Barnett, D., Baldo, B. A., and Howdcn, E. H. (1983) J. Allergy 
Clin. Immunol. 72, 61-68. 

11. Burks, A. W., Williams, L. W., Helm. R. M., Connaughton, C, 
Cockrell, G., and O'Brien, T. J. (1991) J. Allergy Clin. Immunol. 
88, 172-179. 

12. Burks, A. W.. Williams, L. W.. Connaughton, C, Cockrell, G., 
O'Brien, T. J., and Helm, R. M. (1992) J. Allergy Clin. Immunol 
90,962-969. 

13. Burks, A. W., Cockrell, G., Stanley, J. S., Helm, R. M.. and Ban- 
non, G. A. (1996) J. Clin. Invest 9(i, 1715-1721. 

14. Larsen, J. N., Stroman, P., and Ipscn, H. (1992) Mo/ Immunol. 
29, 703-711. 

15. Bannon, G. A., Calzone, F. J., Bowcn, J. K., Allis, C. D.. and Cor- 
ovsky, M. A. (1983) Nucleic Acids Res. 11, 3903-3917. 

16. Needleman, S. B., and Wunsch, C. D. (1970) J. Mol. Biol 48(3), 
443-453. 

17. Watson, C. J., and Jackson, R. F. (1985) DNA Cloning (D. M. 
Glover, Ed.), Vol. I, pp. 79-88, IRL Press. 

18. Huynh, T. Y., Young, R. A., and Davies, R. W. (1985) in DNA 
Cloning, A Practical Approach (Glover, D. M.. Fd.). Vol. I, pp. 
49-78, Oxford Press. 

19. Sanger, F., Nicklen, S., and Coulsen, A. R. (1977) Proc. Natl. 
Acad. ScL USA 74, 5463-5467. 

20. Gayler, K. R., Kolivas, S., Macfarlanc, A. J., Lilley, G. G, Baldi, 
M., Blagrove, R. J., and Johnson, E. D. (1990) Plant. Mol Biol 
15, 879-893. 

21. Nirasawa, S., Nishino, T., Katahira, M., Uesugi, S., Hu, Z., and 
Kurihara, Y. (1994) Eur. J. Biochem. 223, 989-995. 

22. Garcia-Maroto, F., Marana, C, Mena, M., Garcia-Olmedo, F., 
and Carbonero, P. (1990) Plant Mol Biol 14, 845-853. 

23. Joudrier, P., Lullien-Pellerin, V., Alary. R., Grossct, J., Cuirao, 
A., and Gautier, M. F. (1995) DNA Seq. 5, 153-162. 

24. Sanchez-Monge, R., Gomez, L., Barber, D., Lopez-Otin, G, Arm- 
entia, A., and Salcedo, G. (1992) Biochem. J. 281, 401-405. 

25. Armentia, A., Sanchez-Monge, R., Gomez, L., Barber, D., and 
Salcedo, G. (1993) Clin. Exfx AllergyZ?*, 4io_4i5. 

26. James, J. M., Sixbey, J. P., Helm, R. M., Bannon, G.A., and 
Burks, A. W. (1996) J. Allergy Clin. Immunol, in press. 

27. Gibbs, P. E., Strongin, K. B., and McPhcrson. A. (1989) Mol Biol 
£"w7. 6, 614-623. 

28. Chee, P.P., and Slightom, J. L. (1991) Sub-Cell. Biochem. 17. 
31-52. 

29. Bernhisel-Broadbent, J., Taylor, S., and Sampson, H. A. (1989) 
J. Allergy Clin. Immunol 84, 701 -709. 

30. van Kampen, V., Becker, W. M., Chen, Z., Rihs. H. P., Mazur. 
G, Raulf, M., Liebers, V., Isringhoausen-Blcy, S., and Baur, X. 
(1994) MoL Immunol 31, 1133-1140. 

31. Breiteneder, H., Ferreira, F., Reikcrstorfer, A., Duchene, M,, 
Valenta, R., Hoffman-Sommergrubcr, K., Ebner, C, Breiten- 
bach, M., Kraft, D., and Scheiner, O. (1992) J. Allergy Clin. Im- 
munol 90. 909-917. 

32. Ball, G, Shelton, M. J., Walsh. B. J., Hill, D. J., Hoskins, C. S.. 
and Howden, M. E. (1994) Clin. E.\p. Allergy Z^, 758-764. 

33. Aas, K., and Elsayed, S. (1975) Dev. Biol. Stand. 29, 90-98. 

34. Elsayed, S., Holen, E., and Dybenal, T. (1989) Int. Arch. Allergy 
Appl Immunol. 89, 410-415. 

35. Herian. A.M., Taylor, S. L., and Bush, R. K. (1990) Int. Arch. 
Allergy Appl. Immunol. 92, 193-198. 



TgE-ALLERGEN INTERACTIONS OF A PEANUT PROTEIN 



253 



36. Shcinti, K. N., Martin, B. M., Nagpal, S., Metcalf, D. D., and Rao, 
P. V. J. Immunol. 151, 5354-5363. 

37. van der Stoep, N., Korvere, W., and Logtenberg, T. (1994) Eur. 
J, ImniunoL 24, 1307-1311. 

38. Reisman, R. E. (1994) Allergy Proc. 15, 61-63. 

39. Fitzsimons, T., and Grammer, L. C. (1990) Allergy Proc, 11, 156. 

40. Sparholt S. R, Olscn, O. T., and Schou, C. (1992) Clin. Exp. 
Allergy 22, 648-653. 



41. Gieni, R. S., Yang, X., and Hayglass, K. T. (1993) J. Immunol. 
150, 302-310. 

42. Secrist, H., Chelen, C. J., Wen, Y.. Marshall. J. D., and UmcLsu. 
D.T. (1993) J. Exp. Med. 178, 2123-2130. 

43. Garcia, N. M., I^yncii, M. C, Di Prisco, M. C, and Lopez. R. [. 
(1995) J. Invest. AUergol. Clin. Immunol. 5, 18-24. 

44. Oppenheimer, J. J., Nelson, H. S., Bock, S. A., Christ:ensen, F., 
and Leung, D. Y. (1992) J. Allergy Clin. Immunol. 90, 151-152. 



STIC-ILL 



From: 

Sent: 

To: 

Subject: 



Importance: 



Huynh, Phuong N. 

Tuesday, September 21, 2004 4:59 PM 
STIC-ILL 

RE: 09/494,096 rush 



High 




Please deliver: 

J Allergy Clin. Immunol 109:143-149, 2002. 

J. Allergy Clin. Immunol. 105:S 184 (Abstract 555), 2000. 

J. Allergy Clin. Immunol. 105:378-384, 2000. 

J. Allergy Clin. Immunol. 105:8140 (Abstract 423), 2000 

J Allergy Clin. Immunol. 105:8140 (Abstract 423), 2000 

Ann. NY Acad Sci. 964:69-85, 2002. 

Ferreira et al Adv Exp Med Biol 409: 127-135, 1996, 

Fasler el al, J. Allergy and Clinical Immunology 101(4 pt 1): 521-30, April 1998. 
Burks et a/, Eur. J, Biochem. 245: 334-339, April 1997. 

Stanley et al. Archives of Biochemistry and Biophysics 34 2(2): 244-253, June 1997. 
-"^ ^oloick al Trends in Biotech. 18(1): 34-39, Jan^QQ fll^ 

Coiman et al, A structural view of immune recognition by antibodies, pages 33-36, 1994. 
Evens et c^l. Therapeutic Drug Monitoring 15: 514-520, 1993. 

Thanks, 
Neon 

-Art unit 1644 
Room REM 3D75 
Tel (571) 272-0846 



1 



Review 



From genes to protein structure and function: 
novel applications of computational 
approaches in the genomic era 

Jeffrey Skolnick and Jacquelyn S. Fetrow 

The genome-sequencing projects are providing a detailed 'parts list' of life. A key to connprehending this list is understanding 
the function of each gene and each protein at various levels. Sequence-based nnethods for function prediction are inadequate 
because of the nnultifunctional nature of proteins. However, just knowing the structure of the protein Is also insufficient for 
prediction of multiple functional sites. Structural descriptors for protein functional sites are crucial for unlocking the secrets 
in both the sequence and structural-genomics projects. 



Genome-sequencing projects are providing a 
detailed 'parts list' for life. Unfortunately, this list, 
a portion of which represents the amino acid 
sequence of dl the proteins in a given genome, does 
not come with an instruction manual. That is, given 
the genome's sequences, one does not necessarily know 
straight away which regions encode proteins, which 
serve a regulatory role and which are responsible for 
the structure and replication of the DNA itself 

This is not unlike giving a child a list of parts nec- 
essary to create a working automobile. Without the 
necessary expertise, creating the find, working car from 
just the initial parts list is a nearly impossible task. Simi- 
larly unilerstanding how to create a complete, func- 
tioning cell given just the sequence of nucleotides 
found in an organisms genome is a complex problenr 

What is a protein function? 

After a genome is sequenced and its complete parts 
list determined, the next goal is to understand the func- 
tion(s) of each part, including that of the proteins. Wtiat 
do we mean by protein ftinction, die focus of diLs article? 

Function has many meanings. At one level, the pro- 
tein could be a globular protein, such as an enzyme, 
hormone or antibody, or it could be a structural or 
membrane-bound protein. Another level is its bio- 
chemical function, such as the chemical reaction and 
the substrate specificity of an enzyme. The regulatory 
molecules or cofactors that bind to a protein are also 
levels of biochemical function. 

At the cellular level, the proteins function would 
mvolve its interaction with other ma cromolecules and 
the function and cellular location of such complexes. 
There is also the proteins physiological function; that 
is, in which metabolic pathway the protein is involved 
or what physiological role it performs in the organism. 
Finally, the phenotypic function is the role played by 
the protein in die total organism, which is observed by 
deleting or mutating the gene encoding the protein. 

J. Sholnkh (skohikk(a)4{mforthcen\er.org) is at the Dnnforth Plant Science 
Center, Lnboratory of Coinputalional Genomics, 4041 Forest Park 
Avenue, St Umis, MO 63108, USA.J.S. Fetrow is at CeneFonnntics, 
Suite 200, 5H30Oher]in Drive, San Diego, CA 92121-3154, USA. 



Obviously, the complete characterization of protein 
ftinction is difficult but efforts are under way at all levels 
including cellular £imction-''^\ In this article, however, 
we focus on identifying the biochemical function of a 
protein given its sequence, a problem that is amenable to 
molecular approaches. 

Sequence-based approaches to function 
prediction 

The sequence-to-function approach is the most com- 
monly used function-prediction method. This robust 
field is well developed and, in the interest of space 
limitations, we will merely present a brief overview. 
There are two main flavors of this approach: sequence 



alignment-^ 



id sequence-motif methods such as 



Prosite^*\ Blocks'^ Prints'^'" and EinotiP^ Both the 
alignment and the motif methods are powerful but a 
recent analysis has demonstrated their significant limi- 
tations^'^, suggesting that these methods will increasingly 
fail as the protein-sequence databases become more 
diverse. 

An extension of these approaches that combines 
protein-sequence with structural information has been 
developed and some successes have been reported'^'. 
However, tiiis method still applies the structural infor- 
mation in a one- dimensional, 'sequence-like' fashion 
and fails to take into account the powerful three- 
dimension ill information displayed by protein structures. 

In addition, proteins can gain and lose function dur- 
ing evolution and may indeed, have multiple functions 
in the cell (Box 1). Sequence-to-function methods 
cannot specifically identify these complexities, hiaccu- 
rate use of sequence-to-function methods has led to 
significant function-annotation errors in tlie sequence 
databases'^. 

An alternative approach 

An alternative, complementary approach to protein- 
function prediction uses the se queue e-to-structure -to- 
function paradigm. Here, the goal is to tletermine the 
structure of the protein of interest and dien to identify 
the functionally important residues in that structure. 
Using the chemical structure itself to identify Rmctional 
sites is more in line with how the protein actually works. 
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In a sense, this is one long-term goal of 'structural 
genomics' projects^ which are designed to deter- 
mine all possible protein folds experimentally, just 
as genome -sequencing projects are determining all 
protein sequences-^*. This is in contrast to traditional 
structural-biology approaches, in which one knows the 
proteins function first and only then, if the function is 
sufficiently important, determines its structure. 

It is implicidy assumed that having the protein s struc- 
ture will provide insights into its function, thereby fur- 
thering the goals of the human-genome-sequencing 
project. However, knowing a proteins three-dimensional 
structure is insufficient to determine its function 
(Box 2). What we reaUy need to analyse and predict the 
multifunctional aspects of proteins is a method spe- 
cifically to recognize active sites and binding regions in 
diese protein structures, 

Aciivc-siic identification 

In order to use a structure-based approach to Rinction 
prediction, one must identify the key residues respon- 
sible for a given biochemical activity. For many years, 
it has been suggested that the active sites in proteins are 
better conserved than the overall fold. Taken to the 
limit, this suggests that one could not only identify dis- 
tant ancestors with the same global fold and the same 
activity but also proteins with .similar functions but 
' distandy related, or possibly unrelated, global folds. 

The validity of this suggestion was demonstrated 
empirically by Nussinov and co-workers, who showed 
that the active sites of eukaryotic serine proteases, sub- 
tilisins and sulfliydryl proteases exhibit similar structural 
motifs^ ^ Furthermore, in a recent modeling study of 
SacchaYomyces cemnsial' proteins, protein functional sites 
were found to be more conserved than other parts of 
the protein models^^. Similarly, it has been demon- 
strated that the catalytic triad of the a/p hydrolases 
is structurally better conserved than other histidine- 
containing triacLs^^. A comparison of the structure of the 
hydrolase catalytic triad to other liistidine-containing 
triads shows a distinct bimodal distribution, while a 
similar analysis done widr a randomly selected tri;id shows 
a unimodal distribution (Fig. 1). 

Kasuya and Thorn to n^"^ generalized this example by 
creating structural analogs of a few Prosite sequence 
motifs^". For the 20 inost-firequendy occurrmg Prosite 
patterns, the associated local structure is quite distinct. 
Tliese results provide clear evidence that enzyme active 
sites are indeed more highly conserved than other parts 
of the protein. 

Identifying active sites in experimental structures 

Historically, several groups have attempted to iden- 
tify functional sites in proteins; these efforts were 
directed at protein engineering or building functional 
sites in places where they did not previously exist. This 
lias been successfcilly accomplished for several metal- 
binding sites^''"'*^ However, highly accurate fiinctional- 
site descriptors of tlie backbone and side-cliain atoms were 
required, fueling the belief that significant atomic detail 
is required in site descriptors for fimction identification. 

HigWy detailed residue side-chain descriptors of the 
active sites of serine proteases and related proteins have 
been vised to identify functional sites''. The use of tliese 
highly detailed motifs has led to the identiBcation of 



Box 1. Proteins are multifunctional 



A common protein characteristic that makes functional analysis based 
only on homology especially difficult is the tendency of proteins to be 
multifunctional. For instance, lactate dehydrogenase binds NAD, sub- 
strate and zinc, and performs a redox reaction. Each of these occurs 
at different functional sites that are in close proximity and the combi- 
nation of all four sites creates the fully functional protein. 

Other examples of multifunctional proteins are the nucleic-acid-binding 
proteins. For instance, DNA regulatory proteins often contain a DNA- 
binding domain, a multimenzation domain and additional sites that bind 
regulatory proteins; a classic example is RecA^^. The 3C rhinovirus 
protease exhibits a proteolytic function as well as. an RNA-binding 
function6o,6i. Transcription factors are also complex, multifunctional 
proteins62_ |t \s becoming increasingly important to recognize each of 
these different functions of gene products of a newly sequenced gene. 

The serine-threonine-phosphatase superfamily Is a prime example of 
the difficulties of using standard sequence analysis to recognize the 
multiple functions found in single proteins. This large protein family is 
divided into a number of subfamilies, all of which contain an essential 
phosphatase active site. Subfamilies 1, 2A and 2B exhibit 40% or more 
sequence identity between themes. However, each of these subfamilies 
is apparently regulated differently in the cell^^-^^ and observation sug- 
gests that there are different functional sites at which regulation can 
occur. Because the sequence identity between subfamilies is so high, 
standard sequence-similarity methods could easily misclassify new 
sequences as members of the wrong subfamily if the functional sites 
are not carefully considered, as was recently demonstrated'^^. 

These are but a few examples of the multifunctionality of proteins. 
The recognition of this multifunctional nature is of critical importance 
to the genomics field. Useful functional-annotation methods must con- 
sider all of the specific functions in a given protein and will not just 
provide a general classification of function. 



several novel functional sites in known, high-quality 
protein structures^-^"^. More automated methods for 
finding spatial motifs in protein structures have also 
been described^ i--^-^^^*. 

Unfortunately, most of these methods require the 
exact placement of atoms within protein backbones and 
side chains, and so have not been shown to be relevant 
to inexact predicted structures. Recently, however, we 
described the production of fuzzy, inexact descriptors 
of protein functional sites^^. As we wish to apply the 
descriptors to experimental structures as well as to pre- 
dicted protein models, we used only carbon atoms and 
side-chain c enters- of^ mass positions. We call these 
descriptors Tuzzy functional forms' (FFFs) and have 
created them for both the disulfide-oxidoreductase^^ '^^ 
and a/(3-hydrolase catalytic active sites^"^. 

The disulfide-oxidoreductase FFF was applied to 
screen high- resolution structures from the Brookhaven 
protein database^^. In a dataset of 364 protein structures, 
the FFF accurately identified all proteins known to 
exhibit the disulfide-oxidoreductase active site'^. In a 
larger dataset of 1 501 proteins, the FFF again accurately 
identified all proteins with the acdve site. In addition, 
it identified another protein, Ifjm, a serin e-threornne 
phosphatase. This result was initially discouraging but 
subsequent sequence alignment and clustering analysis 
strongly suggested that this putative site might indeed 
be a site of redox regidation in the serine— threonine 
phosphatase- 1 subfamily'^. If confirmed by experiment, 
this result will highlight the advantages of using struc- 
tural descriptors to analyse multiple functional sites in 
proteins. It will also higMight the fact that human 
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Box 2. Knowing a protein's structure does not necessarily 
tell you its function 



Because proteins can have similar folds but different functions^^-^^, 
determining the structure of a protein may or may not tell you some- 
thing about its function. The most well-studied example is the (a/plg 
barrel enzymes, of which triose-phosphate isomerase (TIM) is the arche- 
typal representative. Members of this family have similar overall struc- 
tures but different functions, including different active sites, substrate 
specificities and cofactor requirements^^-^i. 

Is this example common? Our own analysis of the 1997 SCOP data- 
base^s shows that the five largest fold families are the ferredoxin- 
like, the (a/p) barrels, the knottins, the immunoglobulin-like and the 
flavodoxin-like fold families with 22, 18, 13, 9 and 9 subfamilies, respec- 
tively (Fig. i). in fact, 57 of the SCOP fold families consist of multiple 
superfamilies. These data only show the tip of the iceberg, because 
each superfamily is further composed of protein families and each indi- 
vidual family can have radically different functions. For example, the 
ferredoxin-like superfamily contains families identified as Fe-S ferredoxins, 
ribosomal proteins, DNA-binding proteins and phosphatases, among 
others. 

After this article was submitted, a much-more-detailed analysis of the 
SCOP database was published^^. This finds a broad function-structure 
correlation for some structural classes, but also finds a number of 
ubiquitous functions and structures that occur across a number of fam- 
ilies. The article provides a useful analysis of the confidence with which 
structure and function can be correlated^^. Knowing the protein struc- 
ture by itself is insufficient to annotate a number of functional classes 
and is also insufficient for annotating the specific details of protein 
function. 
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Figure i 

Histogram of the numbers of superfamilies found in each SCOP fold family. 
These data clearly show that proteins with similar structures can have different 
functions and demonstrate the difficulty of assigning protein function based 
simply on the three-dimensional structure. The data were taken from the 1997 
distribution of SCOP (http://scop.mrC'lnnb. cam. ac.uk/scop). For a more-detailed 
analysis, see Ref. 72. 



observation alone is no longer adequate for identifying 
all functional sites in known protein structures. 

To date, the use of structure to identify function has 
largely focused on high-resolution structures and higlily 
detailed descriptors of protein firnctional sites. How- 
ever, the creation of inexact descriptors for functioned 
sites opens the way to the application of these methods 
to inexact, predicted protein models. The question 
remains: how good does a model have to be in order 
to use FFFs to identify its active sites? 



The state of the art in structure-prediction 
methods 

For proteins whose sequence identity is above —30%, 
one can use homology modeling to builtl the struc- 
ture'^'^. TTowever, stmcture prediction Ls far more difficult 
for proteins that are not homologoas to proteins with 
known strucUare. At present, there are two approaches for 
these sequences: ab initio folding^^"''^ and thrcading^'^"^\ 

In ah initio folding, one starts from a random confor- 
imition and then attempts to assemble the native struc- 
ture. As this method does not rely on a library of 
pre existing folds, it can be used to predict novel 
folds. The recent CASP3 protein-structure- prediction 
experiment (http://PredictionCenter.llnl.gov/CASP3) 
involved the blind prediction of the structure of pro- 
teins whose actual structure was about to be experi- 
mentally determined. Tliese results indicate that con- 
siderable progress lias been made'^^'--"'^. For helical and 
a/p proteins with less than 110 residues, structures 
were often predicted whose backbone root-mean- 
square deviation (RMSD) from native ranged from 
4-7 A. Progress is being made with the p proteins, too, 
although they remain problematic. Because ah initio 
methods can identify novel folds, tliese methods could 
be used to help to select sequences likely to yield novel 
folds in experimental structural-genomics projects. 

Another approach to tertiary-structure prediction is 
threading. Here, for tlie sequence of interest, one 
attempts to fmd the closest matching structure in a 
library of known folds^-'''\ Threading is applicable to 
proteins of up to 500 residues or so and is much faster 
than ah initio approaches. However, threading cannot 
be used to obtain novel folds. 

Ab hiitio predicted models can be used for automatic 
protein- function prediction 

Tlie results of the recent CASP3 competidon sug- 
gest that current modeling methods can often (but not 
always) create inexact protein models. Are these struc- 
tures useflil for identifying functional sites in proteins? 
Using the ah initio structure-prediction program 
MONSSTER, the tertiary structure of a glutaredoxin, 
lego, was predicted-'^^^ For the lowest-energy model, 
the overall backbone RMSD from the crystal structure 
was 5.7 A. 

To determine whether dns inexact model could be 
used for function idendfication, the sets of correctly 
and incorrecdy folded structures were screened with 
the FFF for disulfide- oxidoreductase activity'^. The 
FFF uniquely identified the active site in the corrccdy 
folded structure but not in the incorrecdy folded ones 
(Fig- 2). This is a proof-of-prhiciplc demonstration diat 
inexact models produced by ah initio prediction of 
structure fi-oni sequence can be used for the subsequent 
predicdon of biochemical Rmction. Of course, improve- 
ments in the method have to be made before such 
predictions can be done on a routine basis. 

Use of predicted structures from tlireadin<^ in 
protein- function p rediction 

At present, practical limitations preclude folding an 
entire genome of proteins using ah initio inethods^'^. 
Threading is more appropriate for achieving the requisite 
high-throughput structure predicdon. Thus, a stand- 
ard threading algorithm^" has been used to screen all 
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proteins iii nine genomes for the dLsulfide-oxi do reductase 
active site described above. 

First, sequences that aligned with the structures of 
known disulfide oxidoreductases were identified. Then, 
the structure was searched for matches to the active- 
site residues and geometry. For those sequences for 
which other homologs were available, a sequence- 
conservation profile was constructed-''. If the putative 
active -site residues were not conserved in the sequence 
subfamily to which the protein belongs, that sequence 
was eliminated. Otlierwise, the sequence is predicted 
to have the fLmction. 

Using tins sequence-to-structure-to-function method, 
99% of the proteins in the nine genomes that have 
known disulfide-o.xidoreductase activity have been 
found. From 10% to 30%) more fiiiictional predictions 
are made than by alternative sequence-based approaches; 
siiniJar results are seen for the a/p hydrolases-^. Sur- 
prisingly, in spite of the fact that threading algorithnxs 
have problems generating good sequence-to-structure 
alignments, active sites are often accurately aligned, 
even for very distant matches. This observation would 
agree with the above experimental results indicating 
til at active sites are well conserved in protein structures. 

Irnportandy the false-positive rate when using struc- 
tural information is much lower than that found using 
sequence-based approaches, as demonstrated by a 
detailed comparison of the FFF structural approach and 
the Blocks sequence-motif approach (N. Siew ci al., 
impublished). In this study, the sequences in eight 
genomes, including Bcicillus suhtilis, were analysed for 
disulfide-oxidoreductase function using the disulfide- 
oxidoreductase FFF, the thioredoxin Block 00194 and 
the glutaredoxin Block 00195. If we assume that those 
sequences identified by both the FFF and Blocks 
are "true positives', we find 13 such sequences in the 
B. siibhlis genome. 

There is no experimental evidence validating all of 
these 'true positives' and so they are more accurately 
termed 'consensus positives'. In order to find these 13 
'consensus positive' sequences, the FFF hits seven false 
positives. On the other hand, Blocks hits 23 false 
positives (Fig. 3). It was previously suggested that the 
use of a functional requirement adds information to 
threading and reduces the number of false positives^^. 
These data, including the data shown in Fig. 3, validate 
this claim on a genome-wide basis. 

Of course, as no genome has had the function of all 
of its proteins experimentally annotated, it is imposs- 
ible to know how many other proteins with the speci- 
fied biochemical function were not properly identified. 
This is a critical question for researchers attempting to 
predict protein function. Experimental confirmation 
will be needed to validate this or any other method 
fully. This points out the need for closely coupling 
computational function-prediction algorithms witli 
experiments. 

Weaknesses of using the sequence-to-structure- 
to-function method of function prediction 

Based on studies to date, the identification of enzy- 
matic activity requires a model in which the backbone 
RMSD from native near the active sites is about 4—5 A. 
Predicted models are better at describing the geometry 
in the core of the molecule than in the loops and so 
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Figure 1 

The distribution of root-mean-square distributions (RMSD) between the hydrolase 
catalytic triad and all other histidine-containing triads shows a bimodal distribution 
(a); by contrast, the RMSD between a randomly selected (non-catalytic) triad and all 
other histidine-containing triads has a unimodal distribution (b). The His-Ser-Asp 
catalytic trial in the protein-1 gpl {Rp2 lipase} (a) and a random histidine-containing 
triad from 4pga (glutaminase-asparaginase) (b) were structurally aligned to all His- 
containing triads in a database of 1037 proteins23. Actual a/ii-hydrolase active sites 
(a| and the 4pga site (b) are indicated by blue bars; other histidine triads that are 
not active sites are indicated by red bars. None of the sites found by matching to the 
4pga were hydrolase active sites. Inset graphs show the full distribution. 

predicting the fanction of a protein whose active site is 
in loops may be a problem. Also, the method can cur- 
rendy only be applied to enzyme active sites; substrate- 
and ligand-binding sites have not been identified using 
the inexact models. Techniques that will further refine 
inexact protein models will be quite useful in taking 
the protein analysis to the next step. 

Conclusions 

Although sequence-based approaches to protein- 
flinction prediction have proved to be very useful, alter- 
natives are needed to assign the biochemical Sanction 
of the 30-50% of proteins whose Ruiction cannot be 
assigned by any current methods. One emerging 
approach involves the sequence-to-sti-ucture-to-function 
paradigm. Such structures might be provided by struc- 
tural- genomics projects or by structure-prediction 
algorithiiLS. Functional assignment is made by screen- 
ing the resulting structure against a library of structural 
descriptors for known active sites or binding regions. 
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Figure 2 

Application of the disulfide-oxidoreductase fuzzy functional form (FFF) to ah init'iQ 
models of glutaredoxin created by the program MONSSTER shows that the FFF can 
distinguish between correctly folded and misfolded (or higher-energy) models. The FFF 
is shown as two orange balls (representing the cysteines) and a blue ball (represent- 
ing the proline). The protein models are shown as magenta wire models with the active- 
site cysteines and proline shown as yellow and cyan balls, respectively. The FFF clearly 
distinguishes the correct active site in the crystal structure of the glutaredoxin lego 
and the correctly folded, lowest-energy model. The FFF does not match to the active 
sites of any of the higher energy, misfolded structures, four of which are shown here. 
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Figure 3 

Analysis of the Bacillus subtiWs genome using the thioredoxin Block 00194. The Blocks 
score (computed using the publicly available BLIMPS program) is plotted on the x axis 
and the number of sequences found in each scoring bin is plotted on the yaxis. Those 
sequences identified as 'consensus positives' [identified by both the fuzzy functional 
form (FFF) and the Block] are shown as red bars. One additional sequence found by 
the FFF, which is likely to be a true positive, is shown as a blue bar. Al! other 
sequences, putative 'false positives', are shown as yellow bars. Using the Blocks 
score at which all 13 of the 'consensus positives' are found, 23 false positives are 
also found. In its analysis of the B. subtffe genome, the FFF identifies only seven false 
positives along with the same 13 'consensus positives' (data not shown). 



Detailed descriptors will only work on the experi- 
mentally determined, high-quality structures. Ideally, 
however, the descriptors should work on both experi- 
mental structures and the cruder models provided by 
tertiary-structure-prediction algorithrrts. 

The advantages of such an approach are that one need 
not establish an evolutionary relationship in order to 
assign function, that more than one function can be 



assigned to a given protein [an issue of major impor- 
tance, because proteins arc multifunctional (Box 1)] 
and, ultimately, that having a structure can provide 
deeper insight into the biological mechanism of pro- 
tein £inction and regulation. The disadvantages are diat 
one needs to have the protein s structure before a func- 
tion can be assigned and that the approach is limited to 
those functions associated with proteins with at least 
one solved structure, so that a functional-site descriptor 
can be constructed. 

hi this sense, structure-to-function assignment can be 
thought of as 'fimctional threading' - find the active- 
site match in a library of descriptors for known protein 
active sites. This is the first step in the long process of 
using structure to assign all levels of function, a goal 
that is made increasingly important with the emergence 
of structured genomics. Based on the progress to date, 
it is apparent diat structure will play an important role 
in the post-genornic era of biology 
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Abstract: Food allergies are IgE- mediated ininiunoLogical reactions; this 
distinguishes them from other adverse reactions to foods. Most (>90%) of 
the recognized food allergies arc generally thought to be caused by eight 
foods or food groups. A number of factors can affect food allergy develop- 
ment, including diet and culture, route of exposure, processing, cooking, 
and digestion. In addition, it is thought that the properties of certain food 
proteins render them more likely to be allergenic than other proteins. 
Most food allergens arc major proteins, polyvalent molecules with at least 
two or more IgE-binding sites, and arc recognized as foreign molecules 
(hence immunogenic). A number of major food allergens have been re- 
cently characterized, and amino acid sequences determined. Tropomyosin 
is the only major allergen of shrimp. A number of TgE-binding epitopes 
have been identifled in this molecule, though they may vary from one 
shrimp-allergic individual to another. Single amino acid substitutions 
within epitopes based on that of homologous, nonreactivc tropomyosins 
can substantially enhance or abolish IgE antibody binding. Using the ac- 
cumulated knowledge of food allergen protein structure, the allergenicity 
of novel proteins to which there has been no prior human exposure has 
been assessed. This has been based primarily on the lability or resistance 
of a protein to enzymatic degradation. Clearly, further criteria must be 
developed to refine this process. In this regard, the development of animal 
models that have been sufficiently validated as surrogates of human IgE 
antibody responses is needed for more precise assessment of the allergenic 
potential of proteins. 

KeY'WORDS: food allergens; metabolic food intolerance; gut-associated 
lymphoid tissue; shrimp tropomyosin 
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FIGURE 1. Induction and effective mechanisms in IgE-mediated hypersensitivity: 
An allergen that crosses mucosal membrane barriers is processed and presented to IgE- 
producing cells (antigen presentation). This stimulates the production of IgE antibodies. 
These antibodies have the ability to bind to mast cell surfaces (mast cell activation); sub- 
sequent interaction with allergen can cross-link two or more membrane-bound IgE mol- 
ecules, which stimulates the release of preformed and newly synthesized mediators of 
allergy. These mediators result in the clinical efiects of allergic reactions. 



INTRODUCTION 

It has been estimated that approximately 1 to 2% of the adult poptilation 
and 4 to 8% of the pediatric population suffer from food allergies. ^"^ Most of 
the recognized food allergies (more than 90%) are generally tho tight to be 
caused by eight foods or food groups. These incktde Crustacea, eggs, fish^ 
milk, peanuts, soybeans, tree nuts, and wheat. The symptoms of food allergy 
vary considerably from mild discomfort, such as itching of the skin or throat, 
to profound, hfe-thi*eatening, systemic anaphylactic shock. Food allergies are 
IgE-mediated immunological reactions, which distinguishes them from other 
adverse reactions to foods that may include metabolic food intolerances or 
toxic food reactions.^ 

The induction of allergic reactions in general, and food allergic reactions 
specifically, is summarized in Figure 1. An allergen, or an immunologically 
intact fragment of allergen resistant to cooking and digestive processes, cross- 
es the epithehal hairier of the gut. After processing and presentation by 
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antigen-presenting cells (generally macrophages), immunologically active 
iTagmeiits are recognized by T lymphocytes, which along with other allergen 
fragments stimulate B lymphocytes to synthesize and ultimately produce TgE 
antibodies specific for that allergen of stimulation. IgE antibodies have the 
unique ability to fix to the surfaces of mast cells or basophils. Upon a second 
or subsequent interaction of allergen with cell-bound IgE molecules, events 
ai'e initiated in the membrane of these cells leading to the release of both pre- 
formed and newly formed mediators. These mediators are molecules that have 
potent phai'maco logical reactions resulting in the clinical effects that ai^e seen 
in allergic reactions.^'" When one views the events leading to the induction 
and manifestation of allergic reaction, there ai*e cleaiiy certain features im- 
posed on these allergenic molecules by their biological activity. For example, 
food allergens must be rather hearty molecules to resist the processes of cook- 
ing and digestion; they must be small enough to cross the epithelial barrier of 
the gut, yet of sufficient size to remain in an immunologically intact fomi. Fi- 
nally, these molecules must be at least divalent and most probably polyvalent 
to cross-link cell-bound IgE molecules resulting in allergic reactions. 

There ai'e a number of factors that can affect food allergy development. 
Diet and culture can have a significant elfet on food allergies that manifest 
themselves in a particular region. For example, codfish allergy is most prev- 
alent in Norway because codfish is an important source of protein for Norwe- 
gians. Similarly, lice and soy allergy are more prevalent in Japan, and peanut 
allergy is more prevalent in the United States. The route of exposure can also 
have an important effect on food-induced allergic reactions. Clearly, inges- 
tion is a much hai'sher route of allergen exposure and thus may require higher 
doses in order for a sufficient number of molecules or allergenic ally active 
moleculai- fragments to survive in an immunologically intact form. However, 
individuals can also be sensitized through inhalation of allergen, a less harsh 
route in which lower doses of allergen can stimulate a response.^ This occurs 
widi occupational sensitization of food workers.^ The relationship of inhala- 
tion to ingestion exposures of food allergen or the cross-reactivity of food al- 
lergens with inhaled allergens in the pathogenesis of food allergy is only now 
beginning to be understood. 

Processing, cooking, and digestion obviously can have a profound effect 
on foods.'^^' Because of the rather harsh ti*eatments to which they cU*e ex- 
posed, food allergens are generally thought to be more resistant to these pro- 
cesses than other antigens. This is not true of all food allergens, however; and, 
cleaiiy, food allergens have been identified that ai"e not resistant to these pro- 
cesses. Finally, one must consider intrinsic factors present in foods that seem 
to affect food allergen development. Why are certain molecules apparently 
much more allergenic than others? Is this only due to their resistance to var- 
ious cooking and digestive processes? Cleaiiy, many nonallergenic molecules 
are equally resistant to food processing methods or digestion, and thus it is 
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thought that there are other factors present in these food molecules that make 
them more allergenic. Further studies are cleaiiy needed to elucidate these 
factors. 

Thus, the investigation of food allergens presents unique challenges when 
compared to studies of other allergens. Exposure to foods is more complicat- 
ed than that to other antigens in that most food antigen exposure occurs 
through ingestion, a route that can significantly alter food molecules, result- 
ing in molecules or moleculai" fragments substantially different from those 
that ai'e identified in food extracts. It is clear that absoiption through the gut 
is needed to induce a food-allergic response, and thus the ability of molecules 
to be absorbed through the gut and the role of gut-associated lymphoid tissue, 
which may be different from that of the central immunological tissue, need to 
be better understood. The effects on the induction of food allergies of inhala- 
tion compai'ed to ingesdon of food allergens needs to be better understood. 
Foods ai'e generally complex, consisting of over 10,000 to 20,000 protein 
molecules and thus present more of a challenge for study. Finally, it is very 
dilTicult, if not impossible, to assess the allergen dose responsible for either 
sensitizing or provoking a food-allergic response or identify molecular struc- 
ture of active fragments. Some informadon is available through food- 
challenge studies on the amount of food that provokes allergic reacdons, but 
unfortunately nothing is known on sensitizing doses. This, again, is due to the 
complexity of the induction of food-allergic reactions.^ 



PROPERTIES OF FOOD ALLERGENS 

Although the precise properties that render a food molecule allergenic are 
unknown, some general characteristics of allergenic food proteins have been 
idendtied. Generally, many major food allergens are major food proteins. 
This suggests that higher doses of a particular allergen enhance the likelihood 
of producing an allergic response. The fact that most food allergens are usu- 
ally expressed as percent of total food protein can confuse the issue of expo- 
sure doses. For example, when using percent protein as a measurement of a 
molecule, low-protein foods such as potatoes may contain a protein such as 
patatin that is 50% of the potato's total protein yet is only a minor potato com- 
ponent (1 g/100 g dry weight), whereas a protein of a similar amount in high- 
protein foods consdtutes a much lower percentage (2^%) of the total food 
protein. Cleaiiy, one must take into account the amount of a protein con- 
sumed when considering exposure. Food allergens are molecules that must be 
capable of stimulating immune responses; this is an absolute requirement be- 
cause molecules that are unable to sdmulate antibody production cannot in- 
duce an immediate hypersensitivity response. Generally, food allergens are 
very stable molecules that resist the effects of food processing, cooking, and 
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digestive enzymes in the gut; intellectually, this makes sense in that mole- 
cules that have such abilities would more likely be allergenic. This is not an 
absolute requirement, however, and there ai^e stable food molecules that are 
not allergenic, while conversely, several food allergens have been identified 
that are not highly resistant to these processes. 

Food allergens must be polyvalent molecules with at least two or more IgE 
antibody binding sites. This is because the mast cell or basophil-bound IgE 
must be cross-linked by these allergen molecules in order to stimulate medi- 
ator release. This is an absolute requirement, although one cannot preclude 
monovalent molecules aggregating in vivo. Generally, most food allergens 
ai'e between 1 0 and 70 kDa and ai^e primarily glycoproteins with acid isoelec- 
tric points; however, this is also the case for most antigens, and thus it is not 
a disdnguishing feature for food allergens. 

The coniposiUon of allergenic foods in comparison to nonallergenic foods 
has always been of interest. The aim, of course, is to be able to identify dif- 
ferences that may be used to identify foods more likely to be allergenic. Com- 
parison of the composition of common allergenic and nonallergenic foods in 
our diet is summarized in Table 1. hi general, high-protein foods are more 
allergenic; of the 23 foods that contain >20% protein, 14 (61%) have been 
idenUfied as important food allergens. Of the foods that contain <20% pro- 
tein, only 5/15 (33%) are important allergenic foods. It is interesting to note 
that of the eight allergenic foods with somewhat lower protein concentration 
(<30%), 5/8 (63%) have high lipid concentrations (>50%) in comparison to 
other components; could elevated lipid levels enhance the allergenic! ty of 
these proteins? Interestingly, almost all low-protein foods (<20%) have very 
high concentrations of carbohydrates, with the exception of the tree nuts. Of 
the 15 foods with <20% protein concentration hsted in this table, (omitting 
tree nuts which have high Upid concentrations) only one (wheat) is consid- 
ered a major food allergen, and die remaining fruits and vegetables have some 
allergenic activity but are not major food allergens. 

What is the allergen concentration or dose that is necessary to stimulate 
sensidzation to a particular allergen or indeed provoke an allergic reaction? 
Unfortunately, there is no direct way this can be determined without develop- 
ing relevant animal models. Perhaps such inlx^rmation could be indirectly ob- 
tained by esd mating the dose of allergen to which individuals are exposed in 
relationship to their allergic responses or IgE antibody production (Table 2). 
For this reason, it is very important to express paiticular protein amounts in 
terms of dose rather than percent protein, which, as discussed previously, can 
be misleading. 

Cross-reactivity among a number of foods of similar food types as well as 
between foods and seemingly unrelated substances has been demonstrated. 
For many years it has been known that foods within a certain group or family 
can cross-react; examples are members of the legume family, such as soy- 
beans, peanuts, and peas,^'^^ and crustaceans, such as lobster, shrimp, and 
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TABLE 1. Composition of common foods in our diet 



Food 850.5 g (3 ozs) 



Total 

Total protein Total tat carbohydrate Water 
gi%r g{%r g(%f g(%) water) 



Cod, Atlantic fillet, baked ^ 19.39 (96.4) 0.73 (3.6) 0(0) 64.55 (75.90) 

Tuna, light, canned 24.52(96.1) 1.04(3.9) 0(0) 53.4 (62.78) 

Sluimp, small, mixed species, steamedl7.78 (95.1) 0.92 (4.9) 0 (0) 65.74 (77.30) 

Crawfish, mixed species, boiled 14.88 (93.1) 1.11 (6.9) 0 (0) 68.72 (66.63) 

Crab, blue, steamed 17.18(91.9) 1.51 (8.1) 0(0) 65.83 (77.40) 

Lobster, northern steamed 17.44 (91.6) 0.50 (2.6) 1.09 (5.7) 64.64 (76.00) 

Crab, Dungeness crab, steamed 18:97 (91.1) 1.05 (5.0) 0.81 (3.9) 62.34 (66.63) 

Salmon, fillet, pink, baked 2L77 (85.3) 3.76 (14.7) 0 (0) 59.28 (69.70) 

Chicken breast, boneless, roasted 25.34 (79.3) 6.62 (20.7) 0 (0) 53.07 (62.40) 

Tuna, canned 24.75 (78.0) 6.98(22.0) 0(0) 50.86 (69:70) 

Beet; roast, lean only 23.97 (76.8) 7.23 (23.2) 0 (0) 52.70 (61 .63) 

Trout, fillet, mixed species, baked 22.62 (75.9) 7.20 (24.2) 0 (0) 53.92 (63.40) 

Pork, fresh whole loin, roasted 23.05 (64.8) 12.50(35.2) 0(0) 48.90(57.50) 

Lamb, NZ leg, roasted 1/8" trim 21.52 (64.4) 1 1.91 (35.6) 0 (0) 50. 18 (59 0) 

Egg 10.63 (52.7) 8,5 1 (42.2) 1.04 (5.2) 64.04 (75.3) 

Soy beverage, tluid 2.34 (42.5) 1.62 (29.5) 1.54 (28.0) 79.35 (93.30) 

Soy flour, whole AMI 3t).24(39.0) 30.24(39.0) 17.01(22.0) 4.39 (5.16) 

Broccoli, boiled 2.64 (36.2) t).iO (1 .4) 4.55 (62.4) 77. 14 (90.70) 

Milk, whole, cow's* 3.01 (29.1) 3.06 (29.6) 4.26 (41.2) 80.52 (88.0) 

Peanuts, dry roasted 20. 16 (25.0) 42.27 (52.4) 1 8.29 (22.7) 1 .32 (1 .55) 

Almonds, diy roasted 18.12(22.8) 45.25 (56.8) 16.24(20.4) 1.79 (2.10) 

Oki-a, boiled 1.59 (20.2) 0.14 (1.8) 6.13 (78.0) 76.45 (89.90) 

Sti-ingheans, canned 0.98 (16.4) 0.09 (1.5) 3.83 (64.2) 79.35 (93.30) 

Wheat grain, duiinii 1 1.65 (15.7) 2.10 (2.8) 60.47 (81.5) 9.27 (10.90) 

Brazil nuts, Im-ge dried 12.16 (15.3) 56.30(71.0) 10.89 (13.7) 2.84 (3.33) 

English walnuts, dried 12.16 (15.1) 52.65 (65.5) 15.56 (19.4) 3.10 (3.69) 

Com, yellow frozen 2.34(12.1) 0.37 (1.9) 16.67 (86.0) 65.23 (76.7) 

Apple 0.16(11.9) 0.31 (2.3) 13.1 (96.5) 71.36 (83.90) 

rioiir, allpuipose white enriched 8.79 (11.8) 0.83 (1.1) 64.89 (87.1) U). 12 (11.90) 

Eggplant, boiled 0.71 (10.8) 0.20 (3.0) 5.65 (86.1) 78.08 (91.80) 

Carrots, boiled 0.93 (9.3) 0.15 (1.5) 8.93 (89.2) 74.33 (87.40) 

Pecans, dried halves 6.59 (8.3) 57.49 (72.3) 15.48 (19.5) 4.10 (4.82) 

Corn Hour, whole grain white 5.89 (7.9) 3.28 (4.4) 65.40 (87.7) 9.27 (10.90) 

Potato tlour 5.87 (7.6) 0.29 (0.4) 70.68 (92.0) 5.55 (6.53) 

Orange 0.80 (7.3) 0.10 (0.9) 10.04 (91.8) 73.82 (86.80) 

Banana 0.88 (4.2) 0.41 (1.9) 19.90 (93.9) 63.19 (74.30) 

Grapes, Thompson seedless 0.56(3.5) 0.49 (3.0) 15.14 (93.5) 68.55 (80.60) 



" F*ercent dry weight. 
Percent total weight. 



crayfish. However, these cross-reactivities may have very dilferenL clin- 
ical endpoints. Cross-reactivity among legumes demonstrated by in vitro tests 
is not necessaiily clinically significant.^-^ However, cross -re activity among 
Crustacea can be clinically significant, and certainly allergy to one crustacean 
suggests potential reactivity to other Crustacea, although not absolute. Cross- 
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TABLE 2. Food allergen concentration 



Protein % Total protein 



Bgg white allergens 




Ovalbumin (Gal d 2) 


54 


Ovomucoid (Gal d 1) 


11 


Conulbiimin (Gal d 3) 


12-13 


Ovomucin 


1.5-3.5 


Lysozyme 


3.4-3.5 


Milk allergens 




P-Lactoglobiilin 


9 


Casein 


SO 


BSA 


1 


a-Lactalbumin 


4 


Soybean allergens 




R-r^nn (T 1 vpini n ciihiinit^ 

yj \^\JLlLjl y llllll yLJ J) LI U LI 11 i 


18.5 


Kunitz trypsin inhibitor 


2-4 


Soy lectin 


1-2 


p-Conglycinin (ot subunit) 


18.5 


Glycinin 


51 


Gly m Bd 30K 


2-3 


Peanut allergens 




Vicilin (Arab 1) 


1« 


Conglutin family (Ara h 2) 




Peanut lectin 


1,3 


Mustard allergens 




Sin a 1 


20 


Braj 1 


20 


Fish allergens 




Parvalbiimin (Gad c 1) 


0.22-0.44 


Shrimp allergens 




Tropomyosin (Pen a 1) 


20^ 


Potato allergens 




Patatin (Sol t 1) 


30-40 



'^Crude peanut meal. 
'''Crude peanut extract. 
''Crude shrimp extract. 



reactivity can also occur between foods and seemingly unrelated substances, 
and indeed this has been of greater interest in recent years. Cross-reactivity 
has been demonstrated with different pollens, paiticularly birch pollen, with 
a variety of fruits and vegetables, based on investigations reported primaiily 
in Europe. More recently, with increased interest in latex allergy, cross- 
reactivity has also been demonstrated between latex proteins and those 
present in fruits and vegetables, '^'^^ Indeed, these cross-reactivities of fruits 
and/or vegetables with inhaled allergens have clinical significance in that 
some patients have developed food allergies subsequent to inhalant exposure. 
It has been shown by in vitro assays that Crustacea and mollusks can cross- 
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FIGURE 2. Common food allergens identified, sequenced, and cloned from 1980 
through 2000. The number of food allergens cloned has increased substantially during the 
past 20 years, and the results to date suggest that it will continue to increase at a veiy rap- 
id pace. As of 2000, 57 food allergens had been cloned. 



react, probably due to the muscle protein tropomyosin, a common allergen 
present in Crustacea. Whether this cross-reactivity is chnically significant or 
not remains to be determined, since many crustacea-aliergic individuals can 
consume mollusks widiout reactivity. Indeed, classic IgE-mediated allergic 
reactions to mollusks with several exceptions^ '"^^ have not been well 
described. 

Since 1980, a number of food allergens have been identified and sequenced 
(Fig. 2). Although common structural properties have been sought, no unique 
structural feature for food allergen has been identified. Over the years, the 
number of food allergens identified and characterized have substantially in- 
creased. To date, the amino acid sequences of at least 57 food allergens have 
been determined; based on available technologies, this number will increase 
substantially over the next several years. More information on allergen struc- 
ture will provide a more acctaate allergenicity assessment, particulai ly since 
one approach to characterization of molecules of unknown allergenicity is 
based on amino acid sequence similarities.^'^^ 



FOOD ALLERGEN EPITOPES: SHRIMP TROPOMYOSIN 

The major allergen in shrimp and indeed other Crustacea has been shown 
to be the muscle protein tropomyosin. ■^^"■^^ Tropomyosin is a major (approx- 
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A 



B 



FIGURE 3. Coiled-coil structure of the tropomyosin molecule (top, A) and this 
stnicture with IgE-binding regions identified (bottom, B). Tropomyosin molecule is rep- 
resented schematically as two chains (red and blue) that are in alpha-helical formation, 
each chain wound around the other, demonstrating the typical coiled-coil stmcture of tro- 
pomyosin. The stRicture is typical for all tropomyosin molecules. The same molecule is 
shown in the bottom portion of the figure (li) with the IgE-binding regions highlighted in 
yellow. Each chain of tropomyosin is identical and parallel. IgE-binding regions appear 
to exist tlii oughout the entire region portion of the molecule. 



imately 20% of the dry weight) shrimp protein, is die only major allergen 
identified in shrimp, and is thought to be a major allergen in other inveite- 
brates as well.^*^~^^ Tropomyosin molecules consist of two identical polypep- 
tide chains, each in an alpha-helical formation coiled around the other; this is 
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FIGURE 4. Individually recognized epitopes of regions 3 and 5 oC the major shrimp 
allergen, Pen a 1. Epitopes, recognized by individual region-reactive sera, arc summa- 
rized in the figure. It is interesting to note that the six sera reactive with epitope 3a appear 
to generally vary considerably in their reactivity, all reacting to a common core of five 
amino acids. Variability is seen with region 3b as well. In contrast, the variability appears 
to be reduced for individual sera reacting to epitopes 5a and 5b. 



refeiTed to as the coiled-coiled structure of tropomyosin,^^ as illustrated in 
Figure 3. Studies in our laboratory have identified five major IgE-binding 
regions present in shrimp tropomyosin (FiG. 3). The IgE-binding epitopes 
within these regions have been characterized ttsing synthetic overlapping 
peptides of 5-15 amino acids in length with an offset of one to two amino ac- 
ids for a particular region. Region-specific antisera were tested for IgE anti- 
body reactivity, and an epitope was identified as the shortest peptide with 
maximal IgE binding to that antiserum. Using these results, eight IgE-binding 
epitopes in five main allergenic regions have been delected.-^^ An example of 
epitopes identified in two major regions, region 3 and region 5, is shown in 
Figure 4. Of interest is the fact that there can be significant variability of in- 
dividual serum reactivity to epitopes. For example, reactivity to epitope 3a 
demonstrates significant wobble in IgE antibody reactivity with a common 
core region of five amino acid residues being the same. In epitopes 5a and 5b, 
the wobble effect is much reduced, and reactivity of individual sera is fairly 
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2 



3 



4 



6 



7 



9 



10 



1 1 




Pen a i Epitope 5a:KEVDRLCDE (251-259) 



2 



4 



6 



7 



8 



10 



li 




Pen a 1 Epitope 5b:KYKSJTDE (266-273) 

FIGURE 5. Combinatorial siibstituiions in epitopes 5a and 5b. Pen a 1 Epitope 5a: 
KEVDRLEDGE (251-259) (spot 1); substitution 2 (S for E at position 252), substiiiirion 
3 (1 for V at position 253), substitution 4 (D for R al position 255), substitution 5 (S for 
E and I for V), substitution 6 (S for E and D for R), substitution 7 (T for V and D for R), 
substitution 8 (S for E, I for V, and D for R), substitution 13 (T for E at position 252), 
substitution 14 (T for E and I for V), substitution 15 (T for E and D for R), and substitu- 
tion 16 (T forE, I for V, and D for R). Pen a 1 Epitope 5b: KYKSTTDE (266-273) (spots 
I and 1 1); substitution 2 (F for S at position 269), substitution 3 (C for T at position 27 1 ), 
substitution 4 (D for E at position 273), substitution 5 (F for S and C for T), substitution 
6 (F for S and D for E), substitution 7 (C for T and D for E), substitution 8 (F for S, C for 
T, and D for E), and substitution 12 (S for T at position 271), 



similar. Perhaps this reflects differences in affinity of IgE antibodies to dif- 
ferent epitopes and raises the question whether IgE antibody reactivity to cer- 
tain epitopes may be more relevant clinically than that to other epitopes on 
the same allergen. 

To identify amino acid residues in an epitope essential for TgE antibody 
binding, mutated peptide epitopes with substituted amino acids based on the 
amino acid sequence of homologous regions in nonallergenic U-opomyosins 
were synthesized and tested for IgE andbody binding with epitope residue an- 
tisera. An example of the IgE andbody binding activity to mutated epitopes 5a 
and 5b is shown in Figure 5. The results indicate that one amino acid subsd- 
tution may have no effect, reduced or abolished IgE binding, or even enhanced 
IgE binding.^''-^"^'^^' This certainly has significant implications when assessing 
IgE antibody reactivity to food allergen epitopes. Generally, greater than two 
substitutions usually abolish IgE-binding abihty to modified Pen a 1 peptides. 
Substitutions from the epitope center are more likely to eliminate (59.5% IgE 
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binding) as opposed to substitutions on the epitope penphery (39.1%). Most 
(63.5%) of the single substitutions that eliminate TgE binding ai-e not conser- 
vative (different amino acid groups) as compai'ed to the 23.1% of the conser- 
vative (same amino acid group) substitutions.^''^ Whether these observations 
for IgE binding epitopes of tropomyosin ai'e relevant to those of other food al- 
lergens is not clear. Generally, however, our results are consistent with those 
reported for epitopes of major peanut allergens Ara h 1,2, and 3.'*'^"-^''^ Com- 
mon patterns of epitope structure and activity observed for food allergens 
would provide important information for assessment of the allergenicity of 
food proteins. 



PREDICTING FOOD PROTEIN ALLERGENICITY: 
FOOD FOR THOUGHT 



Based on cuirent knowledge of the properties of known food allergens, can 
the allergenicity of a novel protein to which there is no prior human exposure 
be precisely predicted? The answer at this time is probably no; however, rea- 
sonable assessments based on our cuirent knowledge of food allergens can be 
made. The information available on known food allergens is increasing, and 
thus the precision in predicting food protein allergenicity will increase as 
well. Several important issues ai-e relevant to any type of predictive model. 
First, food allergen concentration in relationship to food sensitization or 
provocation of an allergic reaction needs to be determined. Unfortunately, lit- 
tle, if any, information is available on the sensitizing dose of a food allergen 
other than crude estimates of doses in prior exposures; there is more informa- 
tion, although sparse, on the allergen dose provoking an allergic reaction. 
Better estimates of exposure levels to known food allergens that stimulate IgE 
antibody production or provoke allergic reactions will aid development of an 
indirect measurement of sensitization dose. If sensitizing doses for different 
food allergens can be estimated and sensitization does not occur below such 
doses, can a level of food allergens be determined below which sensitization 
does not occur? Such information, obtained through analysis of the doses to 
which we are exposed to food allergens, could be invaluable in assessing nov- 
el protein allergenicity. 

Second, a protein's abiUty to resist degradation by processing and enzy- 
matic digestion has been used to assess potendal allergenic and nonallergenic 
food molecules; indeed, it has been a criterion for assessing the potential al- 
lergenicity of novel proteins. Nevertheless, many food proteins that are not 
allergenic are stable to processing and enzymatic digestion. Conversely, some 
protein molecules have been shown to be allergenic yet are sensitive to pro- 
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cessing and/or enzymatic digestion. Does this mean that stability of mole- 
cules is not important in allergenicity assessment? Probably not, but it does 
teli us that this is not an absolute criterion, and one must be careful in using 
this information for assessing proteins of unknown allergenicity. 

Additional criteria to assess the allergenicity of proteins must be developed. 
The sensitizing dose for an allergen mentioned previously may be useful in 
estimating the amount of a food allergen component below which the induc- 
tion of an allergic response is unlikely. Many high-protein foods such as shell- 
fish are major food allergens; however, there are other high -protein foods that 
contain similai* molecules, such as meats like beef or chicken, that are not very 
allergenic. These proteins from less allergenic sources ai^e more similar to 
molecules expressed in man. Thus, the foreignness (immunogenicity) of cer- 
tain proteins has an effect on allergenicity. For example, Crustacea belong to 
a different phylum than humans, cattle, or pigs; and thus crustacean tropomy- 
osins ai'e more foreign and therefore more allergenic in humans compared to 
the more closely related beef or pork ti'opomyosins. 

Amino acid sequence similarity is used to compare protein molecules with 
known allergens; however, the criteria for comparison have not been ade- 
quately described. What chai'acteri sties are important to compare? Are select- 
ed sequence comparison of epitopes (B or T cell) useful or ai-e comparisons 
of the entire molecule more significant? What length of amino acid sequence 
and what percent of identity are necessary for a match? Such criteria are par- 
ticularly important since it has recently been shown that small sequence 
changes, even in one amino acid, of IgE-binding epitopes in peanut or shrimp 
allergens may abolish or even enhance IgE antibody reactivity. Thus, un- 
der cuiTent assessment methods, theoretically an amino acid sequence from 
a protein of unknown allergenicity may have 90% identity with an allergenic 
epitope, yet according to cuiTent criteria may not be included as a potential 
risk. Alternatively, should amino acid sequence homology of novel proteins 
with nonallergens be considered as important? If a protein has a sequence ho- 
mology of more than 80% with a known iionallergen, does that fact rate im- 
poitance in allergenicity assessment? Should the similarity or dissimilaiity to 
human proteins (i.e. foreignness) be assessed? All of these issues need further 
consideration. 

Finally, development of animal models that have been adequately validated 
as surrogates of human IgE antibody responses is required for more precise 
assessments of the allergenic potential of proteins. Such models must be ad- 
equately described with regard to the type of exposure, the route of exposure, 
the number of exposures, the allergen dose, and the use of adjuvants. My own 
prejudice is that an animal model that reflects all aspects of food allergy is 
not only unlikely to be developed, but also probably not necessai'y for assess- 
ment of novel protein allergenicity. However, an animal model of allergenic- 
ity, although reflecting only one aspect of food allergy, would serve to 
provide an important basis for testing or assessing the allergenicity of novel 
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proteins. For validation, animal models must react to allergenic foods as op- 
posed to nonaliergenic foods and recognize protein allergens within the aller- 
genic foods as opposed to nonaliergenic proteins within the same foods. This 
can be easily assessed by Western blotting with animal serum compared to 
human sera from food-allergic individuals. Finally, epitope reactivity of ani- 
mal IgE antibodies must be compared to that of humans. One would expect 
that a validated animal model of allergenicity would produce IgE antibodies 
that react to epitopes similaiiy to human IgE antibodies. If all of these ciiteria 
coiTelate with human allergic responses, such a model will be an invaluable 
tool for the assessment of not only novel proteins but any new proteins to 
which persons are exposed. 

Clearly, our body of knowledge of food allergen structure has increased 
substantially during the past 20 yeai*s. Allergens are molecules that generally 
(although not always) appeal* to be major protein components within a paitic- 
ular food and are usually resistant to processing, cooking, and digestion. Fur- 
ther study of the general properties of foods in relationship to their 
allergenicity is encouraged in that this may provide more useful information 
on the basis of allergenicity of these molecules. For example, foods of medi- 
um protein concentration that are highly allergenic, such as peanuts and tree 
nuts, appear to have higher levels of fat. This might be useful in protecting 
allergenic molecules during the digestive process. Furthermore, Ultle infor- 
mation is known about the dose of allergen required to stimulate immune re- 
sponse. Little consideration has been given to homology of allergens in 
relationship to their foreignness. These are certainly important areas that war- 
rant further study. The evidence to date suggests that we cannot yet precisely 
predict the potential allergenicity of a molecule from which there is no prior 
human exposui-e. However, based on our knowledge of food allergens, it is 
possible to make educated estimations about the potential allergenicity that 
should exclude any major new allergens from being introduced into our food 
supply through genetic modification of foods. 



ACKNOWLEDGMENTS 

The authors wish to ackjiowledge the excellent help in preparation of this 
manuscript by Ms. Pat Constant and the information provided on the compo- 
sition of commonly consumed foods by Dr. Carol O'Neil at the Department 
of Nutrition, Louisiana State University in Baton Rouge, Louisiana. Funds 
provided by the National Fisheries Insdtute and the Department of Medicine, 
Tulane University School of Medicine, helped cover die cost of preparation 
of this manuscript. 



LEHRER etal.: FOOD ALLERGENS 



K3 



REFERENCES 

K Burks, A.W. & H.A. Sampson. 1993. Food allergies in children. Ciirr. Prob. 
Pediatr. 23: 230-252. 

2. Jansen, JJ., A.R Kardinaal, G.H. Huijbbrs, et al. 1994. Prevalence of food 

allergy and intolerance in the adult Dutch population. J. Allergy Clin. Imnui- 
nol. 93: 446^56. 

3. Sampson, H.A. & D.D.' Metcalfe. 1991. humediate reactions to foods. In 

Food Allergy: Adverse Reactions to Foods and Food Additives. D.D. Met- 
calfe, H.A. Sampson & R.A. Simon, Eds.: 99-112. Blackwell Scientific Pub- 
lications. Oxford. 

4. Lehrer, S.B. & G. Reese. 1998. Food allergens: implications for biotechnol- 

ogy. In Biotechnology and Safety Assessment, 2nd ed.': 127-149. Taylor & 
Francis. Philadelphia. 

5. Taylor, S.L. & S.B. Lehrer. 1996. Principles and characteristics of food 

allergens. Crit. Rev. Food Sci. Nutr. 36(S): S91-S1 18. 

6. Lemrer, S.B., W.E. Horner & G. Rcese. 1996. Why are some proteins aller- 

genic? Implications for biotechnology. Crit. Rev. Food Sci. Nutr. 36(6): 553- 
564. 

7. Nelde, a., M. Teufel, C. Hahn, et al. 2001. The impact of the route and fre- 

quency of antigen exposure on the IgE response in allergy. Intl. Arch. Allergy 
hnmunot. 124: 461^69. 

8. O'Neil, C.E. & S.B. Lehrer. 1996. Occupational reactions to food allergens. 

In Food Allergy: Adverse Reactions to Foods and Food Additives. D.D. Met- 
calfe, H.A. Sampson & R.A. Simon, Eds.: 311-335. Blackwell Scientific 
Publications. Oxford. 

9. Barnett, D., B. Bonhan & W.E.H. Howden. 1987. Allergenic cross-reac- 

tions among legume foods — in vitro study. J. Allergy Chn. Immunol. 79: 
443^48. 

10. Taylor, S.L., R.F Lhmanski, R.K. Busk & W.W. Busse. 1987. Food aller- 

gens: structure and immunologic properties. Ann. Allergy 59: 93-99. 

11. Daul, C.B., J.E. Morgan, N.P. Waring, et al. 1987. Immunological evalua- 

tion of shrimp-allergic individuals. J. Allergy Clin. Immunol. 80: 716-722. 

12. Lehrer, S.B. 1986. The complex nature of food antigens: studies of cross- 

reacting Crustacea allergens. Ann. Allergy 57: 267-272. 

13. Bernhisel-Broadbent, J. & H.A. Sampson. 1989. Cross-allergenicity in the 

legume botanical family in children with food hypersensitivity. J. Allergy 
Clin. Immunol. 83: 435-440. 

14. Anderson, L.B., E.M. Dreyfuss, J. Logan & D.E. Johnston. 1970. Melon 

and banana sensitivity coincident with ragweed pollinosis. J. Allergy Clin. 
Immunol. 45: 310-319. 

15. Enberg, R.N., F.E. Leickley, J. McCoullough, etal. 1987. Watermelon and 

ragweed share allergens. J. Allergy Clin. Immunol. 79: 867-875. 

16. Calkhoven, P.G., M. Aalbers, V.L. Koshte, et al. 1987. Cross-reactivity 

among birch pollen, vegetables and fruits as detected by IgE antibodies is due 
to at least three distinct cross-reactive structures. Allergy 42: 382-390. 

17. WUTRICH, B. & R. Dietschl 1985. The celery-carrot-mugwort-condiment syn- 

drome: skin test and RAST results. Shweiz Med. Wochenschr. 115: 258-264. 



84 



ANNALS NEW YORK ACADEMY OF SCIENCES 



18. Makinen-Kiujunen, S. 1994. Banana allergy in patients with immediate-type 

hypertsensitivity to natural rubber latex: characterization of cross-reacting 
antibodies and allergens. J. Allergy Clin. Immunol. 93: 990-994. 

19. M'Raimi, L., D. Charpin, A. Pons, et at. 199L Cross-reactivity between latex 

and banana. J. Allergy Clin. Immimol. 87: 129-130. 

20. Lehrer, S.B. & M.L. McCants. 1987. Reactivity of IgE antibodies with Crus- 

tacea and oyster allergens: evidence for common antigenic structures. J. 
Allergy Clin. Immunol. 80: 133-139. 

21 . Leung, RS., W.K. Chow, S. Duffey, et aL 1996. IgE reactivity against a cross- 

reactive allergen in Crustacea and moUusca: evidence for tropomyosin as the 
common allergen. J. Allergy Clin. Immunol. 98(5 Pt. 1): 954-961. 

22. LOPATA, A.L. & C. ZiNN & P.C. Potter. 1997. Characteristics of hypersensi- 

tivity reactions and identification of a unique 49-kD IgE-binding protein 
(Hal-m-1) in abalone {Haiotis midae). J. Allergy Clin. Immunol. 100(5): 
642-648. 

23. Carrillo, T., RR. de Castro, C. Blanco, et aL 1994. Anaphylaxis due to 

limpet ingestion. Ann. Allergy 73: 504-508. 

24. Daul, C.B., M. Slattery, G. Reese & S.B. Lehrer. 1994. Identification of 

the major brown shrimp {Penaeus azteciis) as the muscle protein tropomyo- 
sin. InL Arch. Allergy Clin. Immunol. 105; 49-55. 

25. SUANTI, K.N., B.M. Martin, S. Nagpal, et at. 1993. Identification of tro- 

pomyosin as the major shrimp allergen and characterization of its IgE binding 
epitopes. J. Immunol. 151: 5354-5363. 

26. Leung, P.S.C., K.H. Ciiu, W.K. Chow, et al. 1994. Cloning, expression, and 

primary structure of Metapenaeus ensis tropomyosin, the major heat-stable 
shrimp allergen. J. Allergy Clin. Immunol. 92: 837-845. 

27. Leung, P.S., Y.C. Chen, D.L. Mykles, et al. 1998. Molecular identification of 

the lobster muscle protein tropomyosin as a seafood allergen. Mol. Marine 
Biol. Biotechnol. 7: 12-20. 

28. Leung, P.S.C, Y. Chen, M.R. Gershwin, et aL 1998. Identification and 

molecular characterization of Charyhdis feriatiis tropomyosin, the major crab 
allergen. J. Allergy Clin. Immunol. 102: 847-852. 

29. Santos, A.B.R., K.R. Tobias, V.P.L. Ferriani, et aL 1999. Identification of 

tropomyosin from Periplaneta americana as a major cockroach allergen. J. 
Allergy Clin. Immunol. 103(No. 1, Pt. 2): S122. 

30. Asturias, J.A., M.C. Arilla, N. Gomez-Bayon, et aL 1999. Molecular char- 

acterization of American cockroach tropomyosin (Per a 7), a cross-reactive 
allergen. J. Immunol. 162: 4342-4348. 

31. Aki, T., T. Kodama, a. Fujikawa, et aL 1995. Immunochemical characteriza- 

tion of recombinant and native tropomyosins as a new allergen from the 
house dust mite, Dennatophagoides fannae. J. Allergy Clin. Immunol. 96: 
74-83. 

32. Reese, G., R. Ayuso & S.B. Lehrer. 1999. Tropomyosin: an invertebrate pan- 

allergen? IntL Arch. Allergy Immunol. 119(4): 247-258. 

33. Smillie, L.B. 1979. Structure and functions of tropomyosins from muscle and 

non-muscle sources. Trends Biochem. Sci. 4: 151-155. 

34. Ayuso, R., S.B. Lehrer & G. Reese. 2002. Identification of continuous, aller- 

genic regions of the major shrimp allergen Pen a 1 (tropomyosin). Int. Arch 
Allergy Immunol. 127(1): In press. 



LEHRER etal.i FOOD ALLERGENS 



85 



35. Reesh, G., R. Ayuso, S.M. Leong-Kee, et al. 2001. Epitopes mapping of the 

major shrimp allergen Pen a 1 (tropomyosin) and mutational substitution 
analysis. Submitted. 

36. Ayuso, R., G. Reese & S.B. Lemrer. 2001. Molecular basis of arthropod 

cross-reactivity: cross-reactive epitopes of shrimp (Pen a 1), house dust mite 
(Der p 10) and cockroach (Per a 7) tropomyosins. Submitted. 

37. Burks, A.W. D. Shin, G. Cockrell, et aL 1997. Mapping and mutational 

analysis of the TgE-binding epitopes on Ara h 1, a legume vicilin protein and 
a major allergen in peanut hypersensitivity. Eur. J. Biochcm. 245(s): 334- 
339. 

38. Stanley, J.S., N. King, A.W. Burks, et al. 1997. fdentification and muta- 

tional analysis of the immunodominant IgE binding epitopes of the major 
peanut allergen Ara h 2. Arch. Biochem. Biophys. 342(2): 244-253. 

39. Rabjohn, p., E.M. Helm, J.S. Stanley, et al. 1999. Molecular cloning and 

epitope analysis of the peanut allergen Ara h 3. J. Clin. Invest. 103(4): 535- 
542. 



STIC-ILL 



From: 

Sent: 

To: 

Subject: 



Huynh, Phuong N. 

Tuesday, September 21 , 2004 10:13 AM 

STIC-ILL 

RE: 09/494,096 




Please deliver: 



Ferreira et al Adv Exp Med Biol 409: 127-135, 1996. 

Fasler ei aU J. Allergy and Clinical Immunology 101(4 pt 1): 521-30, April 1998. 
Burks el al, Eur. J. Biochem. 245: 334-339, April 1997. 

Stanley el al. Archives of Biochemistry and Biophysics 342(2): 244-253, June 1997. 
^^-^^^^ve^ et al. Therapeutic Drug Monitoring IsTsl^^^OTTy^^ 
Skolnick et aL Trends in Biotech. 18(1): 34^39rJan2000. 

Colman et al, A structural view of immune recognition by antibodies, pages 33-36, 1994. 

Thanks, 
Neon 

Art unit 1644 
Room REM 3D75 
Tel (571)272-0846 




^ Therapeutic Drug Monitoring 

15:514-520 © 1993 Raven Press, Ltd., New York 



Biotechnology: An Introduction to Recombinant DNA 
Technology and Product Availability 

ttRonald P. Evens and *Mark Witcher 

"^Manufacturing Department and f Professional Services Department, Amgen Inc., Thousand Oaks; and tSchool of 
Pharmacy, University of Southern California, Los Angeles, California, U.S.A. 



Summary: Exciting advances in biotechnology have led to the development of 
innovative biological products to improve health care. In this review article, 
the major techniques of product development and manufacturing in bioiech* 
nology are discussed, with a focus on recombinant DNA technology. In addi- 
tion, monoclonal antibody, nucleotide blockade, polymerase chain reaction, 
antisense, and gene therapy technologies will be defined briefly. For recom- 
binant-DNA technology, the issues of gene isolation, gene cloning, protein 
expression, scale^up (manufacturing), and quality assurance are addressed. 
The 16 approved products and the research pipeline are characterized. In 
addition, m^or usage issues for biological products are noted. This review 
serves as an introduction to the science and applications of biotechnology 
present and future. Key Words: Biotechnology— Recombinant DNA technol- 
ogy — Biological products. 



During the past two decades, biotechnology has 
evolved from a laboratory science to a dynamic re- 
search-based business aad an established form of 
therapy that benefits patients everyday. The sophis- 
ticated science of biotechnology continues to unfold 
and is expanding rapidly, from the core technolo- 
gies of recombinant DNA procedures and monoclo- 
nal antibodies to whole new techniques, such as 
antisense (anti-RNA code blockade) (1). The bio- 
technology revolution is a success because of the 
scientific ingenuity and entrepreneurial spirit of mo- 
lecular biologists, biochemists, and investment cap- 
italists, who have formed > 1,000 biotechnology re- 
search companies in the United States that develop 
biological products to mitigate the effects of dis- 
eases in new ways. , 

Two indispensable factors in this success have 
been substantial research collaboration with univer- 
sities and research and regulatory cooperation with 
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government agencies. A 1991 survey showed that 
742 biotechnology firms had billion in revenue 
(2). In 1992, the top 79 biotechnology companies, 
according to stock market analysts, generated 
$2,127 billion in revenue (3), which was corrobo- 
rated in a recent review article (13 molecules were 
responsible for $2J4 billion in sales in 1992) (4). 
The research and development commitment is im- 
pressive as well; 742 companies (1991 survey) re- 
portedly invested >$3,63 billion in research (-30% 
of revenues) (2). From 1990 to the year 2000, the 
U.S. biotechnology market is projected to grow ten- 
fold, including pharmaceuticaU diagnostic, agricul- 
tural, and environmental products. This review ar- 
ticle focuses on the scientific principles of biotech- 
nology, product development issues, and products 
currently in use and under development. 

DEFINITIONS 

The molecules or biological products created by 
biotechnology research and manufacturing to date 
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are all proteins. The manufacturing process for 
these molecules uses a living system, usually a cell; 
this cell produces all of its normal proteins plus one 
additional protein, very similar to naturally occur- 
ring human molecule. Van Noordwijk offered a 
broadly applicable, yet simple definition of biotech- 
nology: "the use of tissue cultures, living cells or 
cell enzymes to make a defined product" (5). More 
precise definitions exist and often delineate specific 
processes that make up biotechnology, such as hy- 
bridoma, cell fusion, and recombinant DNA. Fur- 
thermore, biotechnology can be described as a com- 
posite of several key scientific disciplines — 
molecular biology, microbiology, biochemistry, 
immunology, genetics, and engineering. 

In the brief time period from 1982 to 1993, bio- 
logical products have become widely available (15 
molecules and 21 products by the fall of 1993). 
These products are being used in hundreds of thou- 
sands of patients annually for such common mala- 
dies as diabetes, cancer, heart attacks, and anemia. 
Biotherapy is a term now used to characterize the 
clinical use of these products. Its definition encom- 
passes the administration of proteins, which often 
are duplicates or functional equivalents of naturally 
occurring molecules in the human body, in order to 
enhance normal body fiinctions or replace deficient 
or dysfunctional proteins. Biotherapy is being inte- 
grated into patient care along with the established 
therapies of drugs, diet, surgery, physical proce- 
dures, radiation, and psychotherapy. 

DESCRIPTION OF RECOMBINANT 
DNA TECHNOLOGY 

Molecular biology is based on the central tenet 
that DNA makes RNA makes protein (6). This pro- 
cess comprises three major steps: transcription, 
translation, and completion of the protein. Genes 
are DNA nucleotide sequences, the basic units of 
heredity, and the starting material for recombinant 
DNA (r-DNA) technology. In the 23 pairs of chro- 
mosomes in the nucleus, made up of three to four 
billion base pairs of nucleotides, we need to locate 
the one gene of 50,000 genes responsible for pro- 
ducing a specific protein. In addition, genes contain 
coding DNA sequences (axons) responsible for pro- 
tein production; a leader DNA sequence to initiate 
transcription; noncoding DNA material (introns), 
which must be eliminated in the formation of mes- 
senger RNA (m-RNA); and nucleotide sequences 
that indicate the terminal end of a gene and turn off 



transcription. The end product of r-DNA technol- 
ogy is a protein, which includes a set of amino acids 
of the correct number, type, and sequence. Several 
structural requirements are also critical for optimal 
activity of proteins; these include disulfide cross- 
linkage, three-dimensional folding, and glycosyla- 
tion with sugar moieties of the right number* loca- 
tion, and type. 

The steps in the process and the basic elements of 
recombinant DNA technology are outlined in Table 
1 and are described herein. Recombinant DNA 
technology starts with the step of gene isolation, 
which includes three methods. One method begins 
with protein analysis; the composition of the pro- 
tein must be identified fully with regard to the type 
and sequence of amino acids. Because the DNA 
codons (triplets) for each amino acid already are 
known, the DNA sequence then is synthesized in 
reverse from the protein. A second method involves 
isolation of the m-RNA that is responsible for pro- 
ducing a specific protein. Then, the viral enzyme, 
reverse transcriptase, is used to create the comple- 
mentary DNA for the m-RNA. A third method is a 
screening process of the full genomic library when 
the protein cannot be identified fully. From two 

TABLE 1. Steps in process and basic elements of 
recombinant DNA technology 



Step 1 

Gene isolation (three alternatives with different starting 
material) 

From protein analysis to DNA triplets to DNA 
synthesis 

From m-RNA identification plus viral reverse 

transcriptase to c-DNA formation 
From chromosome pool with DNA probes to DNA 

screening for gene 

Step 2 

Cloning/expression (from genes to proteins in the 
laboratory) 
Plasmid and gene-starting DNA materials 
Restrictive endonuclease enzyme for plasmid opening 
DNA ligase enzyme for plasmid closure with human 
gene 

Recombinant plasmid entry into host cell 
Host cell/plasmid cloning (duplication) 
Gene expression of protein by host cells 

Step 3 

Scale-up process (manufacturing of commercial product) 
Inoculum—daughter cells from master cell bank 
Cell culture— serial fermentation— crude protein naixture 
Purification— desired protein in bulk 
Formulation— -final product 
Step 4 

Quality control measures (tests of products and process) 
Plasmid/host cell— integrity assessment 
Bulk product— structure, potency, purity, activity 
Process validation--efriciency and reproducibility 
Final product—for bulk product, plus stability, sterility 
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separate peptide segments of the protein, two dis- 
tinct oligonucleotide probes matching the two pep- 
tides are created. Next, the chromosome is dena- 
tured into several hundreds of thousands of DNA 
segments/This pool of DNA segments is searched 
using each oligonucleotide probe, A DNA segment 
that matches both probes would be a likely clone 
representing the target gene. 

The second step in product development for 
r-DNA technology is cloning and expression, which 
is worked out in the laboratory. Cloning is duplica- 
tion of a gene and its host cells. Expression is pro- 
duction of the protein from the gene in a host cell 
The components of this step include the gene, plas- 
mids, restriction endonucleases, DNA ligases, 
DNA linkers, DNA promoters/enhancers, and host 
cells, 

Plasmids are circular rings of DNA that serve as 
the vector to carry the human gene into host cells. 
Key properties of plasmids are that they are DNA 
material, they have the capacity for autonomous 
replication of DNA, they allow ready acceptance of 
genes, they are transferable between cells, and they 
are stable during the recombinant process. Their 
source is bacteria, where they are responsible for 
transference of virulence or resistance, e.g., the 
pBR322 plasmid. Restriction endonuclease en- 
zymes are employed to cut open plasmids at spe- 
cific nucleotide triplets; a certain enzyme is chosen 
to open the plasmid at sites compatible with the 
terminal triplets of a gene. Bacteria again provide 
this tool, with '-lOft-200 distinct enzymes available. 

DNA ligase enzyme is used to recombine the 
DNA material, incorporating the human gene into 
the bacterial plasmid. If the gene does not permit 
full closure of the plasmid ring, a DNA linker (seg- 
ment of DNA) is added to piece together the gene 
with the plasmid ring. Sometimes a special DNA 
segment from a viral or bacterial source is added to 
the gene/piasmid DNA ring to increase gene repli- 
cation and protein production. This recombined 
plasmid is then transferred to a host cell. 

Host cells need to possess three primary charac- 
teristics: a short reproductive cycle, ease of growth 
in vitro, and acceptability of plasmids. Host cells 
can be fungal (e.g., yeast), bacterial (e.g., E. coll), 
or mammalian (e.g., Chinese hamster ovary) cells. 
The host cells with the recombined plasmids are 
placed in special nutrient mixtures designed for op- 
timal cell viability and protein production. These 
cells are cloned (duplicated) along with intracellular 
plasmid multiplication. These host cells retain the 



capacity to produce all the various proteins in their 
normal life cycle and express one more protein, 
which is the duplicate or functional equivalent of 
the human protein produced by human gene expres- 
sion. The cloning process results in a master work- 
ing cell bank of special new parent cells capable of 
producing human proteins. 

The site of manufacture of proteins in r-DNA 
technology is living cells. Scale-up of protein pro- 
duction is required to manufacture sufficient 
amounts of protein from very large cell colonies to 
meet patients' needs in a reasonably cost-effective 
manner. The manufacturing processes primarily are 
twofold, based on the type of host cells, that is, 
mammalian (eucaryote) or microbial (prokaryote) 
cells. In mammalian cell culture, cells grow slowly, 
with a doubling time of -24^36 h. A production lot 
can require more than a month to produce. Promi- 
nent challenges are to maintain an environment that 
remains sterile, keeps cells viable and productive, 
and avoids any contamination over a 1-2 month pe- 
riod. Mammalian host cells secrete the proteins ex- 
\^ tracellulariy, which enhances harvesting of proteins 
from media. Chinese hamster ovary cells are the 
mammalian cell line employed for epoetin alfa 
(EPOGEN) and alteplase (Activase). For microbial 
cell cultures, the cell doubling time can be as little 
as 30 min, which is highly desirable and, hence, is 
preferred for producing proteins. Microbial fermen- 
tation is an easier process and is more frequently 
used in r-DNA manufacturing. It is employed for 
filgrastim (NEUPOGEN). Microbial cells do not se- 
crete proteins extracellulariy and store proteins in 
vesicles, which complicates harvesting of proteins. 

Four primary steps are involved in scale-up for 
mammahan r-DNA manufacturing: inoculum, , cell 
culture, purification, and formulation. The inocu- 
lum step entails obtaining a single vial of daughter 
cells from the parent cells in the master working cell 
bank. These daughter cells are grown in vitro in 
small spinner flasks to initiate the manufacturing 
scale-up. This initial cell culturing expands cell vol- 
ume with serially larger spinner flasks; cell density 
in the flasks is the critical factor in maximizing cell 
growth. The goal of this inoculum phase is to pre- 
pare a sufficient number of cells for coaduct of the 
cell culture phase. Ceil culture production is the 
core process in the scale-up in r-DNA technology: 
the proteins are produced by serial fermentation, in 
which the amount of cloning (number of functioning 
cells, volume of growth media, and protein produc- 
tion) are increased substantially in a stepwise fash- 
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ion. This cell culture process can employ a large 
fermenter wherein the cells float free in the me- 
dium, that is, a suspension culture. Alternatively, 
attachment technology, e.g., a fluidized bed fer- 
menter or roller bottles, is used, wherein cells are 
fixed on a surface and are bathed in media. 

The fermentation process requires careful control 
of the cell environment, including the nutrient me- 
dium, the air mixture {oxygen, nitrogen, carbon di- 
oxide, and moisture), the breathability of the fer- 
mentation flasks (between flasks and ambient envi- 
ronment), the temperature of flasks, and the 
removal of cellular waste. Media shifts in roller bot- 
tles are performed periodically to maintain cell vi- 
ability. Harvesting of protein from the medium is 
done at several times during the ceD culture pro- 
cess, along with a step to concentrate the liquid 
mixture to create a crude multi-protein mixture. 
Throughout this fermentation period, the cells must 
be kept alive and functional without mutation, 
change, or contamination (e.g., viruses or onco- 
genes). 

Purification of the crude protein is accomplished 
% with a sequence of chromatographic steps to sepa- 
I rate the desired protein from the crude mixture. The 
physical properties of the proteins (such as the size 
of molecules, the hydrophobic properties, and elec- 
trical charge) allow this separation to be done. The 
common chromatographic methods include gel- 
filtration, ion-exchange, affinity, reversed-phase, 
and hydrophobic techniques. Sterile filtration is 
|: then performed, to obtain a purified bulk of target 
1; protein without contaminating material or organ- 
isms. The final step, formulation, requires mixing 
the protein with a diluent plus stabilizers and buff- 
ers specific for that protein product. Filling and la- 
beling of ready-to-use vials is necessary. Some bi- 
ological products are lyophilized attempting to im- 
prove a product's shelf life, although reconstitution 
by practitioners then is required before use. 

For microbial fermentation, the cell culture step 
uses one or several large fermenters containing the 
I appropriate medium. Because of the high cell- 
I growth rate, the fermenter has the capacity to force 
I air into the media to maintain ceil viability. At the 
|; end of fermentation, the cells can be killed without 
i denaturing the proteins that are stored in intracel- 
I lular inclusion bodies. The contents of the fer- 
I menter often are centrifuged to create a cell paste. 
I Recovery of proteins from this microbial system 
I usually calls for extraction of the inclusion bodies 
% from the dead cells. The cells are broken up with a 

I 



homogenizer or other high-shear device to obtain 
the inclusion bodies. A high- shear device also is 
employed to process the inclusion bodies and re- 
move the bulk protein. The next step is purification 
and formulation similar to what we have described 
herein. 

Quality control is a substantial challenge in the 
biotechnology industry, with >70 tests employed to 
ensure the integrity, quality, and activity of the in- 
termediates as well as the final product. Biological 
products usually are proteins; as such, they are 
more complex compounds, have a more detailed 
and sophisticated manufacturing process, and are 
made in living cells, in contrast to the chemical syn- 
thesis and/or extraction processes used for drugs. 
These three features lead to a potentially greater 
opportunity for contamination, alteration, or dena- 
turation of the protein product. 

Four areas are addressed by quality control, that 
is, plasmids and host cells, bulk product, process 
validation, and final product batches. First, a guar- 
antee of the integrity of host cells and plasmids ne- 
cessitates, for example, chromosomal analysis of 
karyotype, contamination screens for viruses and 
oncogenes, and gene stability assessment. Second, 
testing of bulk product is extensive for a protein 
encompassing, for example, high-pressure liquid 
chromatographic analysis, radioimmunoassay, 
Western and Southern blot analyses, amino acid se- 
quencing, peptide maps, and even bioassay. The 
goals of such a large number of sophisticated tests 
are to ensure structure, purity, potency, and activ- 
ity of the protein. Third, final product is evaluated 
with the same tests as for bulk products, along with 
standard assessments of final formulation (such as 
tests of sterility, excipients, volume, appearance, 
and stability). Fourth, process validation tests are 
intended to address the efficiency and reliability of 
the manufacturing process. 

TECHNIQUES OF BIOTECHNOLOGY 

Although recombinant DNA technology is re- 
sponsible for >90% of the available biological prod- 
ucts, biotechnology has evolved to create several 
key technologies for product development, as rep- 
resented in Table 2. Monoclonal antibody (MAb) 
technology has been in use for the past 20 years and 
has led to many diagnostic advances, e.g., ELISA 
(enzyme-linked immunosorbent assays) test kits for 
drug testing and OncoScint CR103, radiolabeled 
monoclonal antibodies for diagnosis of colorectal 
cancer. Therapeutic applications are limited at this 
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TABLE 2. Techniques of biotechnology 

Recombinant DNA technology 
Monoclonal antibody technique 
Polymerase chain reaction 
Gene therapy 
Nucleotide blockade 
Peptide technology 
Carbohydrate technoiogy 

time to one product, antirejection (kidney) therapy 
with murononab, but many products are in devel- 
opment, especially for cancer therapy. Manufacture 
of a MAb requires creation of a hybridoma from 
mouse B cells, which produce specific antibodies 
against a target antigen, and from myeloma cells, 
which have long lives and produce large amounts of 
antibodies (7). The hybridoma possesses properties 
of both cell components. It is cloned and grown to 
yield antibodies, which are then harvested and pu- 
rified. 

The polymerase chain reaction is a biotechnology 
whereby substantial amplification (over several 
100»000-fold) of a target nucleic acid sequence (e.g., 
a gene) is obtained (8). This enzymic reaction hap- 
pens in repeated cycles of a three-step process: (a) 
DNA is denatured to separate the two strands; (b) a 
nucleic acid primer is hybridized to each DNA 
strand at a specific location in the nucleic acid se- 
quence; and (c) DNA polymerase enzyme is added 
for extension of the primer along the DN A strand to 
copy the target nucleic acid sequence. Each cycle 
doubles the DNA molecules copied, and the cycle is 
repeated until sufficient DNA sequence material is 
copied, e.g., 20 cycles with a 90% success rate 
yields a 375,000 amplification of a DNA sequence. 

Gene therapy is employed in inheritable diseases 
where a protein is either dysfunctional or not pro- 
duced because of an abnormal gene (9). These com- 
mon and life-threatening diseases include cystic fi- 
brosis, hemophilia, sickle cell anemia, and diabetes. 
The gene responsible for the malady must be iden- 
tified first, which was done, for example, for ADA 
(adenine deaminase) enzyme deficiency. This dis- 
ease leads to a severely compromised immune de- 
ficiency (SCID), often causing death in childhood or 
adolescence. The normal human gene for the fully 
functional protein is cloned and inserted into a car- 
rier, such as a harmless virus, A patient's cells, 
e.g., T lymphocytes, are grown in the laboratory, 
and the cells receive the gene from the viral carrier. 
The patient's ceils start producing the missing pro- 
tein to correct the deficiency. These cells with the 
extra functional gene then are returned to the pa- 



tient, and the normal protein is produced and re- 
leased, alleviating the disease. For SCID and other 
diseases, gene therapy is life-saving. 

Nucleotide blockade or antisense focuses on ar- 
resting expression of dysfunctional messenger RNA 
or DNA (10). A complementary m-RNA is created 
to match an abnormal m-RNA. The two m-RNA 
strands complex together, preventing translation of 
the m-RNA to form disease-producing proteins. 
Anti-DNA strands also can be created to complex 
with DNA to form a triple helix. Oligonucleotides, 
or short single strands of nucleic acids, instead of 
the full m-RNA also can be employed to block RNA 
expression. Viral disease (herpes simplex and HIV) 
and cancer (oncogenes) are two targets of this form 
of biotechnology. 

Peptide technology entails screening for polypep- 
tide molecules that can mimic large proteins, thus 
affording more simple products that may be more 
stable and easier to produce. The peptides can 
serve as protein receptor agonists or antagonists. 
Carbohydrate chemistry is targeted at inflammatory 
and immune diseases involving leukocyte activa- 
tion. Expression of carbohydrate molecules from 
cell surfaces (cell adhesion molecules) can be pre- 
vented, arresting cell migration or activation and, in 
turn, moderating the leukocyte-related components 
of inflammation. 

These brief descriptions of biotechnology tech- 
niques now employed in product development ex- 
pose the vast potential for many and varied biolog* 
ical products, which will modify or correct many 
diseases. The biological products now in develop- 
ment in clinical trials well exceed 100 molecules. 

BIOLOGICAL PRODUCT AVAILABILITY 

Fifteen biological molecules have been developed 
and marketed in the years from 1982 through mid- 
1993. Fourteen companies are marketing 20 prod- 
ucts. Table 3 lists the molecules, products, compa- 
nies, indications, and year of availability. Some 
products are used for single indications [such as 
insulin for diabetes (1982) or Factor Vlll for hemo- 
philia (1992)]. However, many products are ap- 
proved first for one major indication and then, after 
substantial clinical research into additional indica- 
tions, are approved for other uses at a later time. 
Examples include interferon a-2b (Intron A) for 
hairy cell leukemia (1986), AIDS-related Kaposi^s 
sarcoma (1988), hepatitis C (1991), and hepatitis S 
(1992), and epoetin a (EPOGEN, Procrit) for ane- 
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TABLE 3. Biological products^ 



Generic name 



Trade name 



Therapeutic area 



Human insulin 

Human growth hormone 

Hepatitis B vaccine 

Interferon a-2a 

Interferon a-2b 



Muromonab-CD3 
Alteplase 



Epoetin a 



Interferon a-n3 
Interferon 7- lb 

Filgrastim (G-CSF) 



HurauHn (Lilly) 

Protropin (Genentech) 
Humatrope (Lilly) 
Engerix-B (SmithKlme Beecham) 
Recombivax HB (MSD) 

Roferon A (Hoffman LaRoche) 



Inlron A (Schering-Plough) 



Orthoclone OKT 3 (Ortho Biotech) 
Activase (Genentech) 



Epogen (Amgen) 
Procrit (Ortho Biotech) 

Alferon N (Interferon Sciences) 
Actimmune (Genentech) 

Neupogen (Amgen) 



Sargramostim (GM-CSF) Leukine (Immunex) 

Prokinc (Heochst-Roussel) 



Aldesleukin 
Factor VIll 
Interferon p 



Proleukin (Chiron) 

KoGenate (Miles) 
Recombinate (Baxter) 
Betaseron (Chiron, Bcrlex) 



Insulin-dependent diabetes 

melHtus (1982) 
Human growth hormone 

deficiency in children (1985) 
Hepatitis fi prophylaxis (1986) 



Hairy ceil leukemia (1986); 

AlDS-related Kaposi's 

sarcoma (1988) 
Hairy cell leukemia (1986); 

AIDS-related Kaposi's 

sarcoma (1988); chronic 

hepatitis types B (1992) and C 

(non-A, non-B) (1991); 

condylomata acuminata (1988) 
Acute allograft rejection in renal 

transplant patients (1986) 
Acute myocardial infarction 

(1987); pulmonary embolism 

(1990) 

Certain anemias — chronic renal 
disease (1989); AIDS (1991); 
cancer chemotherapy (1993) 
Condylomata acuminata (1989) 
Chronic granulomatous disease 
(1990) 

Febrile neutropenia (infection) 
due to myelosuppfessive 
chemotherapy (1991) 

Myeloid reconsthution after 
bone marrow transplantation 

, (1991) 

Metastatic renal cell carcinoma 

(1992) 
Hemophilia A (1992) 

Multiple sclerosis (1993) 



G-CSF, granulocyte colony-stimulating factor; GM-CSF, granulocyte-macrophage colony- 
stimulating factor. 
" List of products is given in order of date of first product approval. 



mia stemming from chronic renal disease (1989), 
from AIDS-related antiviral therapy (1991), and 
from cancer chemotherapy (1993). The disease en- 
tities treated by these 15 molecules are common- 
place and address most organ systems and 18 sep- 
arate conditions. They can be used in endocrinol- 
ogy (diabetes, growth), hepatitis (types B and C), 
oncology (hairy cell leukemia, AIDS-related Kapo- 
si's sarcoma, renal cell carcinoma, neutropenia, 
and infections from chemotherapy), anemias, car- 
diology (heart attacks, pulmonary embolism, hemo- 
philia), neurology (multiple sclerosis), and organ 
(kidney) rejection. This brief description estabUshes 
the profound impact of biotechnology on health 
care, bringing therapeutic agents to treat diseases 
that often were previously untreatable. A new arm 
of therapy, biotherapy, has been created. 



FUTURE BIOTECHNOLOGY PRODUCTS 

The first decade of biotechnology drug develop- 
ment produced an admirable record of one new bi- 
ological molecule per year, on average. This early 
success is expected to be followed by a host of 
products in similar and new therapeutic areas. Ac- 
cording to 1991 Pharmaceutical Manufacturing As- 
sociation (PMA) data, 132 biological products were 
in clinical development and 21 molecules were at 
the Federal Drug Administration (FDA) for regula- 
tory review and potential approval (11). The time 
frame to bring such new biological products to mar- 
ket (patient availability) has been documented to 
average --4.8 years in the United States and 3 years 
in Europe; two-thirds of this time is consumed by 
clinical research and one-third by regulatory review 
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(12) . According to PMA data, from 1982 to 1992, 
FDA review of product license applications for bi- 
ological products averaged 24 months, and new in- 
dications took 30 months for review and approval. 
Over the past 5 years, the research pipeline has 
grown from <80 to >130 products (60% growth), 
and the expected time required for product research 
and approval is --5 years. 

These new biotherapies incorporate both exten- 
sions of current product categories and whole new 
entities. Table 4 cites 10 product categories for fu- 
ture biological products and includes >150 new 
products or indications that are in development 

(13) . Most therapeutic areas are being targeted by 
biotechnology, but oncology has the largest number 
of products; some two-thirds to three-quarters of 
interferons, interleukins, and monoclonal antibod- 
ies are used in this area. In vaccines, AIDS is the 
major focus. Tissue-growth factors, including epi- 
dermal growth factor, fibroblast growth factor, and 
platelet-derived growth factor, address repair of 
wounds due to injury, surgery, chronic disease, or 
decubiti. Clotting factors include proteins to im- 
prove clotting and to dissolve clots. The last cate- 
gory (labeled 'Various'*) includes such molecules 
as dismutases, tumor necrosis factor, and soluble 
CD4. 



TABLE 4. Future biotherapies 




Number of indications 


Category 


and products 


Antisense 


5 


Clotting factors 


8 


Hematopoietic factors 


12 


Hormones 


7 


Interferons 


16 


Interleukins 


13 


Monoclonal antibodies 


58 


Tissue growth factors 


11 


Vaccines 


18 


Various 


>15 



See Burrill and Lee for summary (13). 



Biological products have become the mainstay of 
therapy for major diseases already benefiting mil^ 
lions of patients each year, with only 15 molecules 
and 20 products in use to date. Future products and 
indications exceed 100 in number. The full impact 
of these therapies and the benefits to patients still 
are being discovered, Biotherapy holds the promise 
of a natural therapy, wherein proteins, nucleotides, 
and other human molecules are recreated and em- 
ployed to ameliorate or even cure disease. We are 
on the threshold of a pharmacologic and therapeutic 
revolution, 

REFERENCES 

1. Drews J. Into the 21st century: biotechnology and the phar- 
maceutical industry in the next ten years. Bio/technology 
1993;11:S16-S20. 

2. Dibner MD. Tracking trends in the U.S. biotechnology. Biol 
technology 1991;9:1334-7. 

3. Miller L, Escrow J, Malloy M, eds. Biotechnology industry 
evaluation: 1992 sector analysis. New York: Paine Webber 
Inc., Health Care Group, 1992. 

4. Klausner A. Back to the future: biotech product sales 1983- 
1993. Bio/technology i993;n:S35-6. 

5. Van Noordwyk J. Quality assurance of products manufac- 
tured by recombinant DNA technology: introduction and 
elements of a philosophy, Anneimiitalforschung 1988 ;33: 
943-7. 

6. Satlele DB, ed. Biotechnology in perspective. London: Hob- 
sons Publishing, 1990. 

7. Pirofski L. OasadevaU A, Rodriguez L, Zuckier LS, Schaiff 
MD. Current state of hybridoma technology. J Clin Immunol 
1990;10(suppl. 6):5s-'14s. 

8. Lipner M, The polymerase chain reaction: amplifying its role 
in research and beyond. Oncol Times 1992; 14-5. 

9. Goldspiel BR, Green L, Calls KA, Human gene therapy. 
Clin Pharmacy 1993;12:48S-505. 

10. Nagel KM, Holstad SG, Isenberg KE. Oligonucleotide phar- 
macotherapy: an antigene strategy. Pharmacotherapy 1993; 
13:177-88. 

11. In development: biotechnology medicines. Washington, 
D.C.: Pharmaceutical Manufacturing Association, 1991. 

12. Bienz-Tadmor B. Biophannaceuticals go to market: patterns 
of worldwide development. Bio! technology 1993;11:168-72. 

13. Burrill GS, Lee Jr KB. Biotech '92: promise to reality. An 
industry annual report. San Francisco: Ernst and Young, 
1991. 



Ther Drug Monit, Vol 15, No. 6, 1993 



STIC-ILL. S//7S(^ 

From: Huynh, Phuong N. 

Sent: Tuesday. September 21, 2004 10:13AM 

To: . STIC-ILL 

Subject: RE: 09/494.096 wpi, 



Please deliver: Caii # tl^zS'''' ^^^^^^st , 

Ferreirae/t?/, Adv Exp Med Biol 409: 127-135, 1996. 

Faslere/a/, J. Allergy and Clinical Immunology 101(4 pt 1): 521-30, April 1998. 
Burks el a/, Eur. J. Biochem. 245: 334-339, April 1997. 

Stanley et al. Archives of Biochemistry and Biophysics 342(2): 244-253, June 1997. 
Evens et al. Therapeutic Drug Monitoring 15: 514-520, 1993. 
Skolnick et ah Trends in Biotech. 1 8(1): 34-39, Jan 2000. 



^/MZlolman et al, A structural view of immune recognition by antibodies, pages 33-36, 1994. 



Thanks, 
Neon 

Art unit 1644 
Room REM 3D75 
Tel (571)272-0846 




1 



21 2004 20:23 FR CISTI ICIST 



613 952 9303 TO 15712720230 



P. 03 



A STRUCTURAL VIEW OF IMMUNE RECOGNITION BY ANTIBODIES 33 



dimensional structure of immunoglobulins. Artnu, 
Rev. Biochem,, 44, 639-667. 

Deisenhofer, J. (1981), Crystallographic refinement and 
atomic models of a human Fc fragment and its com- 
plex with fragment B of a protein A from Staphylo- 
coccus aureus at 2.9 A and 2.8 A resolution. 
Biochemistry, 20, 2361-2370, 

Guddat, L.W., Herron, J.N. i&Edmundson. A.B, (1993), 
Three-dimensional structure of a human immunoglo- 
bulin with a hinge deletion. Proc. Nat. Acad. Sci, 
(Wash.), 90, 4271-4275. 

Harris, L.J., Larson, S.B., Hasel, K.W., Day, J., Green- 
wood, A. & McPhcrson, A. (1993), The three- 
dimensional structure of an intact monoclonal anti- 
body for canine lymphoma. Nature (Lond.), 360, 
369-372, 

Kohler. G. & Milstein, C, (1975), Continuous cultures of 
fused cells secreting antibody of predefined speciH- 
city. Nature (Lond.), 256, 495-497. 

Marquan, M., Eteiscnhofer, J. & Ruber, R. (1980), 
Crystaliographic refinement and atomic models of the 
intact immunoglobulin Kol and its antigen-binding 
fragment at 3.0 A and 1.9 A resolution. /. MoL Biol., 
141. 369-391. 

Padlan, E.A., Davies, D.R., Rudikoff. S. & Potter, M. 
(1976), Structural basis for the specificity of 
phosphorylcholine-binding immunoglobulins. Immu- 
nochemistry, 13^ 945^949. 



Poljak, R. J., Amzel, L.M., Avey, H.P., Chen, Phi- 
zackerley, R.P. & Saul, F. (1973), Three-dimensional 
structure of the Fab' fragment of a human immuno- 
globulin at 2.8 A resolution. Proc. Nat. Acad. ScL 
(Wash,), 70, 3305-3310. 

Segal, D.M., Padlan, E.A., Cohen, G.H.. Rudikoff, S., 
Potter, M. & Davies, D.R. (1974), The three- 
dimensional structure of a phosphorylcholine-binding 
mouse immunoglobulin Fab and the nature of the 
antigen binding site. Proc. Nat. Acad. ScL (Wash.), 
71.4298-4302. 

Schiffer, M., Girling, R.L., Ely, K.R. & Edmundson. A. 
(1973), Structure of a X-type Bence- Jones Protein at 
3.5 A resolution. Biochemistry, 12, 4620-4631. 

Sarma, V.R., Silverton, E.W., Davies, D.R. & Terry, W.D. 
(1971), The three-dimensional structure at 6,0 A reso- 
lution of a human yGl immunoglobulin molecule. 
7, BioL Chem., 246, 3753-3759. 

Wu, T.T. & Kabat. E.A. (1970), An analysis of the sequen- 
ces of the variable regions of Bence Jones proteins 
and myeloma light chains and their implications for 
antibody complementarity. 7. Exp, Med., 132, 
211-250. 

Zhao-chang Fan, Lin Shan, Guddat. L.W,, Xiao-min He, 
Gray, W.R., Raison, RX. & Edmundson, A.B. 
(1992), Three-dimensional structure of an Fv from a 
human IgM immunoglobulin, /. MoL BioL, 228, 
188-207. 



Effects of amino acid sequence changes 
on antibody-antigen interactions 
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Amino acid substitutions in antibody-antigen in- 
terfaces play an important role in affinity matura- 
tion of antibody responses and in antigenic variation. 
Structural studies show that some non-conservative 
changes are tolerated in these interfaces. By analogy 
with observations of amino acid exchangeability with- 
in homologous or mutated protein structures, this 
suggests that the number of different antibody speci- 
ficities is less than tiie total number of antibodies 
which can be encoded by the various genetic mechan- 
isms. On the other hand, conformational adaptabil- 
ity of antibody structures during binding of antigen 
suggests that one antibody can serve to bind a num- 
ber of different antigens. 



Single amino acid sequence changes within the in- 
terface of an antibody-antigen complex are impor- 
tant in two biological contexts. Firstly, such changes 
in the antibody have the capacity to drive the affini- 
ty towards more tightly bound complexes. Second- 
ly, such changes in the antigen can effectively abolish 
the interaction entirely, providing an effective 
mechanism for antigenic variation. Although anti- 
body and antigen display biologically asymmetric be- 
haviour in regard to the effects of single amino acid 
substitutions, in principle and in practice, substitu- 
tions in either partner can raise or lower the affini- 
ty. Tolerance of amino acid sequence substitutions 
within antibody-antigen interfaces suggests that they 
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be considered examples either of protein-protein in- 
teractions generally or of the core of folded protein 
structures. In those cases, degeneracy of amino acid 
sequence information is well documented among fa- 
milies of homologous proteins, and it suggests that 
the number of different antibody specificities is less 
than the estimated number of different antibody 
molecules. 



Antigenic variation 

Substitutions which result in loss of binding are 
not amenable to direct study of complexes, although 
several studies have sought to rationalize the effects 
of the new amino acid in the context of the complex 
involving the *'parent" amino acid. 

Escape mutants of viral antigens, selected by 
growth of virus in the presence of monoclonal anti- 
body, provide many examples of the type of substi- 
tution which can render the antigen unrecogmzable 
by the selecting antibody (see, for example, Webster 
et al, 1987). There are no discernible trends in these 
data which suggest preference for replacement of 
residues in one physico-chemical class by those in 
another as a preferred means of abohshing binding 
(Colman, 1992). Studies of the three-dimensional 
struaures of a number of these antibody selected mu- 
tants shows that local changes, at and immediately 
adjacent to the site of the mutated residue, suffice 
to disrupt the interaction with the selecting antibody 
(Knossow et aL, 1984; Varghese et al., 198S; Tuhp 
et 1991). 

Tulip (1990) has observed that a variety of escape 
mutants of the influenza virus neuraminidase, select- 
ed by the anti-neuraminidase antibody NC41, result 
in shape changes of the antigen which could inter- 
fere with surface docking if the remainder of the an- 
tigen and the antibody were rigid and unable to relax 
around the mutated residue. In another case, a 
hydrogen bond is potentially lost by substitution to 
a smaller residue (Tulip, 1990). Bhat et aL (1990) 
report that substitution of Gin by His in lysozyme 
also causes shape changes which cannot be accom- 
modated by the Dl.3 anti-lysozyme antibody, and 
that, in addition, a hydrogen bond would be lost 
from the interaction. In both of these examples, the 
key to the failure of the mutant to bind may reside 
as much in the inflexibility of the surrounding struc- 
tures as it does in the physico-chemical environment 
within the interface of the mutated residue. 

Structural data on the effects of point changes in 
the binding site are available in only one system at 
present, the influenza virus neuraminidase (Tulip et 
aL. 1992). Two mutants of the N9 subtype neu- 
raminidase, selected with antibodies other than 
NC41, involve substitutions within the binding site 



on N9 for NC41. In each case, the affinity is reduced 
by one to two orders of magnitude (Webster et aL, 
1987- Gruen et aL, 1993). One of these mutants. 
Asn 329 to Asp, is located near the edge of the bind- 
ing site for the NC41 antibody and is accommodat- 
ed by re-positioning the side chain of residue 329 
towards the solvent exposed perimeter of the 
antibody-antigen interface. The other. He 368 to Arg, 
results in a complex pattern of concerted movements 
around the mutation site to accommodate the arginyl 
residue. These changes include a shift by 3 Am the 
position of the arginine from its location in the un- 
complexed neuraminidase structure and a shift by 
more than 1 Aof a histidine on the antibody. In both 
of these structures, the effects of the mutations wi- 
thin the interface are reminiscent of the effects of 
amino acid substitutions within the interior of pro- 
tein molecules (Matthews, 1991), Anderson et aL 
(1993) where localized structural rearrangements 
around the mutation site are frequently observed. 
Analysis of structures in the Brookhaven data base 
(Bernstein et aL, 1977) suggests a somewhat looser 
packing density of atoms within an antibody-antigen 
interface compared to other protein-protein inter- 
faces or the interior of protein molecules (Tulip et 
aL, 1992; Lawrence and Colman, 1993). This sug- 
gests, in turn, that antibody-antigen interfaces should 
not be more sensitive to amino acid substitutions than 
other protein-protein interfaces. 



Somatic mutation 

Point mutations accumulating within the varia- 
ble domains of antibody heavy and light chains are 
associated with increasing affinity of the antibody for 
antigen. 

Alzari et aL (1990) have demonstrated that when 
2-phenyloxazolone binds to an antibody, it makes 
direct contact with amino acids of the domain 
which are known to be frequently mutated in anti- 
bodies with increased affinity for the antigen. In con- 
trast, in this same system, somatic mutation m the 

domain in no case maps to amino acids directly 
involved in binding the hapten. 

Engineered substitutions in an anti-/?- 
azophenylarsonate antibody (Sharon, 1990) have 
demonstrated that three of nineteen somatic changes 
observed in the V„ domain suffice to increase the 
affinity by a factor of 200. Comparison with the 
three-dimensional structure of the antibody (Rose et 
al , 1990) suggests that, in every case, the effect must 
be indirect, since none of the three substituted 
residues is likely to be in direct contact with the an^ 
tigen. In this example, there is not yet a report of 
the antibody-antigen complex structure, nor is the an- 
tibody structure highly refined, so the conclusion 
should be considered preliminary. 
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Structural context and repertoire size 

The above examples paint a confusing picture of 
the specificity of antibody-antigen interaction. In one 
structural context, a very conservative substitution 
may abolish binding ; in another, a non-conservative 
substitution may have very little effect on the bind- 
ing affinity. How should these observations be treat- 
ed if one is to estimate the effective size of the 
antibody repertoire from genetic sources alone? One 
approach is to treat the antibody-antigen interface 
the same way as the interior of a protein structure, 
and to bring to bear on this problem current 
knowledge about the degeneracy of "structural in- 
formation'* among the twenty amino acids. 

Current estimates of the potential number of an- 
tibody molecules that can be generated by all the 
known genetic mechanisms is in excess of 10 
(Hunkapiller and Hood, 1989). This and similar other 
estimates assume each of the 20 amino acids is differ- 
ent from every other amino acid, which is appropri- 
ate for purposes of enumeration but not for the 
purpose of estimating how many different antibody 
specificities can be produced by an animal. 

Data from exchangeability matrices (Dayhoff et 
aL, 1978) or from degeneracy of information in ami- 
no acid sequences resulting from overlapping genes 
(Sander and Schulz, 1979) suggest that, for structural 
purposes, or more precisely for folding purposes, 
there are effectively only four or five '^different*' 
amino acids. Can the same reduction be applied to 
binding interactions between proteins? Folding could 
be considered to be more tolerant of amino acid sub- 
stitutions because of the cooperative effects caused 
by the interactants being covalently linked to each 
other. Binding interactions could be considered less 
tolerant because the changes involved occur in what 
might be called the active site. Outside of the 
antibody-antigen system, mapping of protein-protein 
interacting surfaces by mutational analysis is also 
generally successful (Bowie et al,, 1990). but some 
unexpected and unexplained findings do occur (de 
Voss et aL, 1992), suggesting a measure of tolerance 
of amino acid substitutions generally in protein- 
protein interfaces. 

These arguments affect considerations of anti- 
body repertoire sizes. In the facile extreme of sub- 
stituting five (types of amino acids) for twenty in the 
calculations of Hunkapiller and Hood (1989), the ex- 
pected number of different antibody specificities is 
reduced to order 10*. Another way to estimate the 
number of different specificities is to argue from the 
physical size of a typical binding site on antibody for 
antigen. For protein antigens, this surface size is of 
the order of 15 amino acids. The numbers of differ- 
ent specificities that can be encoded over such a sur- 
face based on twenty or five structurally different 
amino acids are order 10*^ and 10***, respectively, if 
one ignores the important influences of overall sur- 



face shape and partiality of solvent exposure of ami- 
no acids within the surface. Similar arguments and 
estimates of repertoire size are applicable to T-cell 
receptors. 

Some compromise in the capacity of the immune 
system to cope with "foreign" structures is implied 
by these degeneracy arguments. The above estimates 
have carried the argument to the extreme and could 
therefore be viewed as an extreme lower limit of 
repertoire size. Nevertheless, genetic sources of diver- 
sity are only part of the story. Antibodies, as pro- 
teins, display the usual types of conformational 
adaptability in binding to ligands, as do other pro- 
teins, i.e. side chain rearrangements and main chain 
changes within loop structures (Colman, 1988 ; Wil- 
son and Stanfieid, 1993). In addition, there is grow- 
ing experimental evidence for the functioning of the 
V^-Vj^ interface as a structural adaptor allowing 
movements of the heavy chain CDR en masse with 
respect to the light chain CDR during engagement 
with antigen (Colman, 1988, 1991; Herron et aL, 
1991 ; Bhat et aL, 1990; Stanfield etaL, 1993). These 
changes in antibody structure are beheved to be 
specific to the interacting antigen. Different antigens 
may therefore induce different structural responses 
in the same antibody, adding a structural dimension 
to diversity (Colman, 1988), 
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Recognition of carbohydrates by antibodies 

M. Cygler 

Biotechnology Research Institute, National Research Council, 
6100 Royalmount, MontreaL Quebec H4P 2R2 (Canada) 



The recognition by antibodies of complex poly- 
saccharides forming the outer layer of cell walls is 
an integral part of the immune response to bacterial 
infection and invasion by non-self cells. For these rea- 
sons, the interactions of antibodies with sugars have 
been extensively investigated by physicochemical and 
immunochemical methods (Young et al., 1983; 



Bundle, 1989; Glaudemans, 1991 ; Sigurskjoid and 
Bundle, 1992). The conformation of free (Bush, 
1992) and antibody-bound oligosaccharides has been 
studied by NMR spectroscopy (Glaudemans, 1991 ; 
Bundle etaL, submitted) and crystallography (Cygler 
et aL, 1991 ; Vyas et aL, 1993). NMR measurements 
showed that in some cases, binding to an antibody 
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Antagonistic peptides specifically inhibit 
proliferation, cytokine production, CD40L 
expression, and help for IgE synthesis by Der p 
1-specific human T-cell clones 



Stephan Fasler, PhD, Gregorio Aversa, PhD, Jan E. de Vries, PhD and 
Hans Yssel, PhD Palo Alto, Calif. 



Background: Allergic disorders are characterized by IgE an- 
tibody responses to a multitude of allergens as a result of the 
ability of these antibodies to specifically bind to high-affinity 
IgE receptors on mast cells and basophils. This interaction 
results in receptor activation and release of soluble media- 
tors such as histamine and leukotriencs, which cause allergic 
reactions in various target organs. Because the synthesis of 
IgE is tightly regulated by cytokines and CD40 ligand (L) 
interactions, CD4+ helper T cells are obvious targets, uith 
the aim to modulate allergen-induced IgE responses. 
Objectives: Because of the central role of allergen-specific 
T-helper type 2 (T„,) cells in the pathway leading to IgE syn- 
thesis in vitro and in vivo, we have evaluated the possibility 
of inhibiting allergen-induced activation of these cells by us- 
ing allergen-derived peptides that have been modified by sin- 
gle amino acid substitutions. 

Methods: Three cloned human T„2-like CD4^ T-ccIl lines 
specific for Der p 1, the major allergen in house dust, were 
used in this study. Upon activation with Der p 1 or specific 
Der p 1-dervived wild-type peptides, these T-cell clones pro- 
duce high levels of IL.4 and IL-5 and low levels of interfer- 
on-7 and IL-2, respectively, and furthermore give help to B 
cells for the production of IgE in vitro. Modified synthetic 
peptides were generated by the introduction of single amino . 
acid substitutions into two different T-cell activation-induc- 
ing epitopes on Der p 1. The effects of these modified pep- 
tides were studied in Der p 1-induced proliferation, cytokine 
production, and in vitro IgE production assays. 
Results: Several substituted Der p 1-dcrived peptides failed 
to induce T-cell proliferation, in contrast to the native pep- 
tides. In addition, some of these peptides acted as antago- 
nists by strongly inhibiting wild-type peptide-induced prolif- 
eration as well as the production of interferon-7, IL-2, IL-4, 
and IL-5, although the production of the latter two cytokines 
was less affected than that of interferon-y, even at a 100-fold 
molar excess of the antagonistic peptides. In addition, the 
presence of an excess of each of the antagonistic peptides 
during the activation of Der p I-specific T-cell clones pre- 
vented induction of CD40L expression, resulting in a failure 
of these cells to give help to B cells for the production of IgE 
in vitro, even in the presence of exogenous IL-4. 
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Conclusions: Substitution of certain amino acid residues in 
immunogenic Der p 1-derived peptides results in the genera- 
tion of peptides that fail to induce proliferation of Der p 
1-specific T-cell clones. In addition, these modified peptides 
have strong antagonistic activities on Der p 1-induced prolif- 
eration, cytokine production, and CD40L expression by aller- 
gen-specific T-cell clones as well as on T cell-mediated IgE 
production by B cells. These findings suggest that modified 
peptides mterfere with allergen-induced activation of T cells, 
including the production of cytokines and the expression of ' 
surface molecules important for successful T cell-B cell in- 
teractions, and may therefore have therapeutic potential by 
inhibiting the expansion and function of allergen-specific T^^ 
cells. (J Allergy Clin Immunol 1998;101:521-30.) 

Key words: Der p 1, allergen-specific T lymphocytes, allergen- 
denved peptides, cytokine production, CD40L, IgE synthesis 

Allergic disorders are characterized by the presence of 
antibodies of the IgE isotype, which in house dust 
mite-induced atopy are mainly directed to the Dermato- 
phagoides pteronyssinus group 1 (Der p 1) and group 2 
(Der p 2) allergens, The synthesis of IgE by B cells in 
vitro is specifically induced by IL-4 and IL-13,^ ^ together 
with a T cell-B cell contact-mediated signal', accounted 
for by the interaction of CD40 and CD40 ligand (L),^-^ 
The pivotal role of CD40L in induction of IgE isotype 
switching and IgE synthesis has been demonstrated in 
patients with hyper-IgM syndrome who have mutations 
in their genes encoding CD40L and, as a result, have no 
or minimal levels of IgG, IgA, and IgE in their circula- 
tion. 'o-^^ Although CD40L is expressed on all activated 
T cells, CD4^ T-helper type 2 (Th^) cells are considered 
to play a pivotal role in house dust mite-induced atopy 
because of their capacity to produce high levels of IL-4 
and IL-13.^4-16 Therefore Der pl-specific T„, cells are 
obvious targets for immune intervention in house dust 
mite-related atopic diseases and prevention of activa- 
tion, and subsequent cytokine production by these cells 
may be beneficial to the clinical status of atopic patients. 

Allergen-specific CD4^ T cells recognize peptide an- 
tigens, presented by antigen-presenting cells (APCs) in 
the context of a major histocompatibility complex 
(MHC) class 11 molecule. On the basis of crystallo- 
graphic studies, it has been demonstrated that all but 
one of the amino acid side-chains within the peptide- 
binding groove of the MHC class 11 protein can interact 
with the T-cell receptor (TCR) upon contact between 
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Abbreviations used 


APC 


Antigen-presenting cell 


CD40L 


CD40 ligand 


EBV-LCL 


Epstein-Barr virus-transformed 




lymphoblastoid B-ccll lines 


IFN 


Interferon 


mAb 


Monoclonal antibody 


MHC 


Major histocompatibility complex 


TCR 


T-cell receptor 


Th2 


T-helper type 2 (cells) 



the CD4 * T cell and the APC.i^'^« Depending on the 
positions of replaced amino acids, however, the change 
will affect the interaction of the peptide with the pep- 
tide-binding grove of the MHC class II protein, the 
interaction of the peptide-class II complex with the 
TCR, or both. 

In addition, there is increasing evidence that substitu- 
tion of certain amino acids of immunogenic peptides will 
induce qualitative changes in T-cell responses as a result 
of altered signaling by the TCR upon its interaction with 
the peptide~MHC complex. Stimulation of mouse T 
cells with antigenic peptides containing single amino 
acid substitutions reportedly resulted in TCR antago- 
nism, segregation of proliferative and cytokine re- 
sponses, or even induction of anergy.^^-^'* Also, in human 
systems, stimulation of T-cell clones with altered pep- 
tides has been shown to result in changes in T-cell 
responses. Peptides with a single amino acid substitu- 
tion, which acted as TCR antagonists, were found to 
selectively induce the production of transforming growth 
factor-pi by myelin basic protein-specific human T-cell 
clones but were no longer able to induce the production 
of IL-2, IL-4, IL-10, and interferon (IFN)-7 by these 
cells.^^ Recently, it was reported that stimulation of 
T-cell clones with altered peptides, derived from domi- 
nant epitopes on the house dust mite allergen Der p 2 
allergcn,^^ or the Japanese cedar pollen allergen Cry j 
1," respectively, resulted in the selective enhancement 
of IFN-7 but not of IL-4 production or induction of 
cytokine production in the absence of proliferation.^^ 
The above-mentioned studies show that single amino 
acid substitutions in T-cell activation-inducing epitopes 
on major allergens may modulate the cytokine produc- 
tion profile of allergen-specific T cells. However, no 
information has been provided on the capacity of these 
peptides to inhibit in vitro IgE synthesis. Moreover, the 
effects of modified peptides on T cell-B cell-mediated 
interactions, which are required for productive IgE 
synthesis, have not been reported yet. 

Therefore in this study we have investigated the 
capacity of peptides, derived from two minimal T-cell 
activation-inducing epitopes on the major house dust 
mite allergen Der p 1 and modified by single amino acid 
substitutions to block the activation of human Der p 
1-specific T-cell clones and to inhibit the production of 



T([2 cytokines, the expression of CD40L and T cell- 
mediated IgE synthesis by B cells in vitro. 

METHODS 
Cells 

The human CD4^ TH2-Hke, HLA-DRllOl-restricted, T-cell 
clones NP-12 and NP-44, specific for the p94-104 sequence of 
Der p 1, have been described previously.'-^ The CD4+ T|i2 clone 
AR-142 recognizes the Der p 1-derived peptide pl71-182 in 
the context of HLA-DR1501 (S, Fasler, unpublished results). 
All Der p 1-specific T-cell clones were generated by stimulation 
of peripheral blood mononuclear cells, obtained from atopic 
patients who were allergic to Der p 1, with 1 p.g/ml of a 
lyophilized extract of Der p 1 (ALK, Bornholm, Denmark), 
followed by cloning by using a limiting dilution method, as has 
been described.'^ The T-cell clones were cultured at a concen- 
tration of 2 X 10^ cells/ml in 24-well Linbro plates (Flow, 
McLean, Va.) after stimulation with a feeder cell mixture 
consisting of lO*^ cells/ml irradiated (40 Gray) allogeneic periph- 
eral blood mononuclear cells, 10^ cells/ml of the irradiated (50 
Gray) Epstein-Barr virus-transformed lymphoblastoid B cell 
line (EBV-LCL) JY, and 0.1 fjig/ml PHA (Wellcome Diagnos- 
tics, Dartford, U.K.), as described previously.^ Three to 4 days 
after each restimulation with feeder cells, the cultures were 
expanded in culture medium containing 20 U/ml rIL-2 and 200 
U/ml rIL-4 (kindly provided by Dr. G. Zurawski, DNAX 
Research Institute). All cloned T-cell lines and EBV-LCL were 
free of mycoplasma and were cultured in Yssel's medium, 
supplemented with 1% human serum.^^ Experiments were 
carried out in the same culture medium unless indicated 
otherwise. HLA DRllOl-matched monocytes to be used as 
APC for NP-44 cells (generously provided by Drs. C. Figdor 
and R. Huybens, Academisch Ziekcnhuis Nijmegen, The Neth- 
edands) were isolated from the peripheral blood of a healthy 
donor by centrifugal elutriation as described previously.^^ 

Proliferation assay 

Cellular proliferation was measured with an pHJTdR assay 
as has been previously described. In brief, 10"* T cells were 
mixed, with 5 x 10"^ irradiated (40 Gy) HLA-matched mono- 
cytes or 5 X 10^ irradiated (50 Gy) autologous EBV-LCL that 
had been pulsed for 3 hours with specific peptide at concentra- 
tions ranging from O.l to 10 |xg/ml in a flat -bottomed, 96-well 
tissue culture plate (Flow Laboratories) in a total volume of 200 
|jl1. After 72 hours of incubation at 37° C, 37 kBq PH]TdR 
(Aniersham, Arlington Heights, III.) was added to each well. Six 
hours later, the cells were harvested onto glass fiber strips and 
the amount of incorporated pHlTdR was measured by liquid 
scintillation counting. The results are expressed as the mean of 
triplicate cultures ± the standard deviation. Inhibition of 
proliferation by substituted peptides was calculated by the 
following formula: 

Proliferation with wild type peptide - 
spontaneous proliferation 

?7 • ^ , rp— X 100% 

prolueration with antagonistic and wild type peptide ~ 

spontaneous proliferation 

Stimulation of T-cell clones and assays for 
cytokine production 

One million T-cell clones per milliliter were cocultured with 
2 X 10^ autologous EBV-LCL that had been pulsed for 3 hours 
with specific peptide (0.1 to 10 ixg/ml) in 24-well Linbro tissue 
culture plates. After 48 hours of activation, supernatants were 
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TABLE I. Substituted peptides derived from two native Der p 1 sequences 



Wild-type 
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V 
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1 


V 


G 


Y 
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A 


Q 


G 
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1 


1 
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1 


1 


1 


1 


1 


171-182 


7 


7 


7 


7 


7 


7 


7 


7 


7 


8 


8 


8 




1 


2 


3 


4 


5 


6 


7 


8 


9 


0 


1 
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Substitution 


G 


A 


A 


A 


A 


A 


A 


A 


A 


G 


A 


A 



K 
G 
W 
I 
D 
S 



Effect* Ac Ac (An) Ac No No Ac No Ac No (A^ Ac 



Wild-type 


Y 


R 


Y 


V 


A 


R 


E 


Q 


S 


C 


R 
















1 


1 


1 


1 


1 


94-104 


9 


9 


9 
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0 


0 


0 


0 
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Substitution 
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A 


A 


A 


A 


Effect* 


Ac 


(S) 


© 


Ac 


Ac 


Ac 


@ 


No 


Ac 


Ac 


Ac 



Ac, Activation- inducing; No, nonslimulatory; ^Anj antagonistic. 
*Effect of substituted peptide. 



haivested and stored at -80° C before testing. Cytokine levels in 
the supcrnatants of activated T cells were determined by 
ELISA, with cytokine-specific antibodies for IL-2, IL-4, IFN-7, 
and IL-5, as has been described.''' The sensitivity of each assay 
was approximately 50 pg/ml. 

Analysis of CD40L expression by flow cytometry 

One million T cell cells per milliliter were stimulated with 

2 X 10^ irradiated (50 Gy), autologous EBV-LCL that had been 
pulsed for 3 hours with 1 fig/ml of specific immunogenic 
peptide in the presence or absence of a 100-fotd molar excess of 
substituted peptide for 16 hours. To prevent downregulation of 
CD40L on activated T cells as a result of interaction with CD40 
expressed on the APC, 1 \Lg/m\ of anti-CD40L monoclonal 
antibody (mAb) was added to the cultures (R. Noelle, personal 
communication). Methods of immunofluorescence staining, 
flow cytometry, and data analysis have been described by Lanier 
and Recktenwald.^^ Flow cytometry to analyze the expression of 
CD40L on activated T-cell clones was performed with a FACS 
Calibur flow cytometcr (Becton Dickinson Co., San Jose, Calif.) 
and the anti-CD40L mAb LL48 (CD 154, kindly provided by Dr. 
J. Banchereau, Schering-Plough Research Laboratories, Dar- 
diliy, France). 

Measurement of Ig production 

One million T cells per milliliter were stimulated with 2 X 10^ 
irradiated (50 Gy), autologous EBV-LCL (AR-142) or 2.5 X 
10^ HLA-matched monocytes (NP-44) that had been pulsed for 

3 hours with 1 (xg/ml of specific immunogenic peptide in the 
presence or absence of a 100-fold molar excess of substituted 



peptide, and culture supcrnatants were harvested after 24 hours 
of culture. The supcrnatants were added to 10'' highly purified 
human B lymphocytes (> 99% CD20+), isolated from normal 
spleens,^^ which were then stimulated with the anti-CD40 mAb 
89^^^ in 96-well round-bottomed plates (Flow) in Yssel's me- 
dram, supplemented with 10% fetal calf serum and 10 \ig/m\ 
ultrapure transferrin (Pierce, Rockford, N.Y.) in a final volume 
of 200 |xl. For T cell-mediated IgE synthesis, T-cell clone NP-44 
was activated for 24 hours with HLA-matched monocytes, 
pulsed for 3 hours with 1 |xg/ml of p94-104 in the presence or 
absence of a 100-fold molar excess of Y96A, and after washing 
of the cells, lO'' T cells were added to lO'* purified B cells in the 
presence of exogenous rlL-4 (100 U/ml). Fourteen days later, 
the supcrnatants from triplicate cultures of B cells were har- 
vested, pooled, and analyzed for Ig isotype content by ELISA as 
described previously." The sensitivity of the ELISA was deter- 
mined with a calibrated standard from Behring (Marburg, 
Germany) and was 0.2 ng/mL 

Generation of Der p 1-derived peptides 

Der p 1-derived peptides, representing minimal T-cell acti- 
vation-inducing epitopes or including single substitutions, were 
synthesized on an advanced Chemtech 350 Synthesizer (Ad- 
vanced Chemtech, Louisville, Ky.). The carboxyl-terminal 
amino acid coupled to Wang resin was used as a the solid phase 
and the peptides were extended with FMOC chemistry, accord- 
ing to the manufacturer's instructions. Peptides were character- 
ized by laser desorption mass spectrometry (FinniganMat, San 
Jose, Calif), peptide sequencing, and quantitative amino acid 
analysis and had a purity of approximately 90%. 
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FIG 1. Effect of amino acid substitutions in peptide p171-182 on 
proliferation and cytokine production of Der p 1 specific T-cell 
clone AR-142. T cells were stimulated witfi various concentrations 
of w, d-type and substituted peptides, in presence of autologous 
tBV-LCL, and proliferative responses (A| were measured after 72 
hours as described in Methods section. Productton of iL-2 fL-4 
and IFN-7 (B) was analyzed after incubation of T cells with 1 nq/ml 
of peptide and EBV-LCL for 48 hours 



RESULTS 

Mutational analysis within minimal T-cell 
activation-inducing epitopes 

The T-cell clones NP-12, NP-44, and AR442 specifi- 
cally proliferate and secrete cytokines in response to 
stimulation with the immunogenic, Der p 1-derived 
wild-type peptides p94-104 and pl7M82, respectively 
in the presence of autologous or HLA-matched APCs' 
To generate modified peptides that could block allergen- 
specific activation of these T-cell clones, single amino 
acid substitutions were introduced into the immuno- 
genic peptide sequences (Table I). Stimulation of the 
T-cell clones with autologous EB V-LCL in the presence 
of various concentrations of substituted Der p 1-derived 
peptides resulted in various levels of T-cell proliferation 
as well as cytokine production. Maximal proliferative 
responses to wild type peptide were obtained at approx- 
imately 1 ^gjm\ of peptide as shown in Fig. I and Fig 2 
Peptides derived from pl7M82 with substitutions of 
amino acid residues at positions 173, 175, 176, 180, and 
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181 with Ala (A) or Gly (G) failed to induce prolifera- 
tive responses and production of IL-2, lL-4, and IFN^-v 
by clone AR-142 (Fig. 1). In addition, peptides in which 
the neutral Asn (N) residue at position 173 was replaced 
by either basic [Lys:K], hydrophobic [Trp:W, Ile:I] 
acidic [Asp:D], or hydrophilic residue [Ser:S] also did 
not induce proliferation and cytokine production of 
AR-142 (results not shown), indicating that Asn at 
position 173 is one of the critical residues required for 
T-ceil activation. Similarly, substitutions with Ala or Gly 
at positions 95, 96, 100, and 101 of peptide 94-104 
resulted in modified peptides that were ineffective in 
inducing proliferation and IL-2, IL~4, and IFN-^ pro- 
duction by T-cell clones NP-12 and NP-44 (Fig. 2) The 
lack of proliferation-inducing capacity of these peptides 
was always associated with the inability to induce cyto- 
kine production, and no dissociation between prolifera- 
tion and cytokine production or between the production 
of IL-2, lL-4, and IFN-y was observed. Substitutions at 
ammo acid positions other than those mentioned above 
induced normal or only slightly reduced proliferative 
responses and cytokine production of the T-cell clones 
compared with those induced with the native, nonsub- 
stituted, peptides. 

Jnhtbition of proliferation by antagonistic 
peptides 

Next we determined whether the substituted peptides 
that failed to induce T-cell activation could act as 
antagonists by inhibiting the wild-type peptide-induced 
proliferation of the Der p 1-specific T-cell clones 
Stimulation of T-cell done AR-142 with wild-type pep- 
tide P171-182 and a 10-fold molar excess of the substi- 
tuted peptides N173A or Q181A cell clone AR-142 
resulted m an inhibition by more than 50% of the 
proliferation induced by pi 71-182 only (Fig. 3). Maximal 
inhibition of proliferative responses of approximately 
70 /o was observed with a 100-fold molar excess of 
antagonistic peptides. Three nonstimulatory, modified 
peptides, generated by single amino acid substitutions in 
the p94-104 peptide, had strong inhibitory activities on 
the proliferation of T-cell clones NP-12 and NP-44 
Substituted peptides R95A, Y96A, and QIOIA strongly 
inhibited the proliferation of T-cell clone NP-12 in a 
dose-dependent manner. Weaker inhibitory effects of 
these peptides were observed on proliferative responses of 
T-cell clone NP-44. Preincubation of the APCs for 6 hours 
with a lOO^foid molar excess of antagonistic peptides 
before addition of the wild^ype peptide, yjelded similar 
inhibitory effects (S. Fasler, unpublished results). 

The substituted peptides that inhibited Der p 1-in- 
duced proliferation of T-cell clone AR-142 were not able 
to block proliferative responses of T-cell clones NP-12 
and NP-44 and vice versa, indicating that the inhibitory 
effect of these peptides was not due to toxic effects. All 
other substituted peptides with stimulatory activity 
listed in Table 1, did not affect Der p 1-specific prolif- 
erative responses of the T-cell clones at any concentra- 
tion used (data not shown). Further mutational analysis 
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FIG. 2. Effect of annino acid substitutions in peptide p94-104 on proliferation and cytokine production of Der 
p 1-specific T-cell clones NP-12 and NP-44. Proliferative responses (A) and cytokine production (B) of T-cell 
clones NP-12 and NP-44 were measured as described in Fig. 1. 



of residue 173 in the wild-type peptide pl71-182, after 
substitution of Asn with Lys, Gly, Trp, He, Asp, and Ser, 
produced peptides with varying antagonistic activities. 
The inhibition of the Der p 1-induced proUferation of 
T-cell clone AR-142, at a lOO^fold molar excess of 
substituted peptide, ranged from 9% to 78% with de- 
creasing inhibitory activity, in the order Ala, Ser, Gly, 
Lys, Trp, He, and Asp (data not shown). 

Inhibition of cytokine production by antagonistic 
peptides 

To further examine the effects of substituted peptides 
on T-celi effector function, the cytokine production by 
Der p l~specific T-cell clones, induced by the wild-type 
peptides in the presence of an excess amount of modi- 
fied peptide, was studied. As is shown in Table II, the 
antagonistic peptides inhibited the proliferative re- 
sponses of the Der p 1-specific T-cell clones in a 
dose-dependent fashion with a maximal inhibitory effect 
at a 100-fold molar excess. Therefore this concentration 
of peptide was used in further experiments. The antag- 
onistic peptides R95A, Y96A, and ElOOA, which inhib- 
ited the proliferation of T-cell clones NP-12 and NP-44, 
also strongly blocked the production of lL-2 and IFN-7 
at a 100-fold molar excess. Comparable results were 



obtained with the antagonistic peptides N173S, Q181A, 
and N173A, which had strong inhibitory activities on the 
-proliferative responses of T-cell clone AR-142. How- 
ever, the relatively high production levels of IL-4 and 
IL-5, in particular those produced by T-cell clones 
NP-44 and AR-142, were only partially affected (Table 
II), even in the presence of a 500-fold molar excess of 
antagonistic peptide (data not shown), suggesting a 
difference in sensitivity of the T^i versus the Th2 
cytokines to the inhibitory effects of the antagonistic 
peptide. However, none of the substituted peptides had 
a differential effect on the production of the various 
cytokines; furthermore, antagonistic peptides that inhib- 
ited cytokine production also inhibited proliferative re- 
sponses- 
Antagonistic peptides inhibit help for IgE 
synthesis by B cells by blocking IL-4 production 
and expression of CD40L 

Activated Der p 1-specific T-cell clones give help for 
IgE synthesis to purified B cells for the synthesis of IgE 
through their production of IL-4, as well as a signal 
mediated by the interaction of CD40 and its ligand 
CD40L. In view of their inhibitory effects on the cytokine 
production of Der p 1-specific T-cell clones, we investi- 
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prohferat.on and represent mean and standard deviation of three independent experiments 



TABLE II. Inhibition of cytokine production of Der 
p 1-specific T-ce!l clones by substituted Der p 1~ 
derived peptides 



% Inhibition of cytokine 
productiont 



T-cell clone* 


peptide 


IFN-7 


IL-2 


lL-4 


IL-5 


AR442 


0181A 


86 


100 


51 


34 




N173A 


93 


100 


50 


37 




N173S 


94 


100 


50 ■ 


29 




N173G 


76 


100 


44 


11 




N173K 


68 


100 


29 


4 




N173r 


63 


89 


27 


5 




N173W 


56 


73 


28 


3 




NI73D 


37 


12 


6 


-12 


NP-12 


R95A 


44 


99 


69 


81 




Y96A 


70 


95 


75 


92 




ElOOA 


90 


100 


86 


99 


NP-44 


R95A 


98 


100 


46 


50 




Y96A 


101 


100 


44 


60 




ElOOA 


98 


100 


43 


78 



*T cells were stimulated with autoiogous EBV^LCL, pulsed with 1 ^tg/ml 
of wild^ype peptides pl71-182 (AR-142) or p94-104 (NP-12 and 
NP-44), respectively, and a 100-fold molar excess of substituted 
peptides as described in Methods section. 

tC^okine production was me^isured after 48 houi^ of cuKure as described 
in Methods section. Values for cytokine production after stimulation with 
wild-type peptides are similar to those shown in Figs. 1 and 2. 

gated whether antagonistic peptides could inhibit T-ccU- 
mediated induction of IgE synthesis by purified B cells in 
vitro. As demonstrated previously, the addition of cul- 
ture supernatants of T-cell clone NP-44, activated with 



peptide p94-104 and APQ to highly purified B cells 
resulted in the induction of high levels of polyclonal IgE 
(Fig. 4).34 Supernatants collected from cultures of NP-44 
cells, stimulated with the wild-type peptide p94-104 and 
a 100-fold molar excess of Y96A, showed a reduction of 
approximately 50% to induce IgE synthesis in vitro by B 
cells preactivated with anti-CD40 mAb (Fig. 4, A). 
Similar results were obtained with T-cell clone AR442 
and N173A (Fig. 4, B% reflecting the level of partial 
inhibition on the production of IL-4 by these antagonis- 
tic peptides. To determine whether such antagonistic 
peptides could also block IgE synthesis mediated by 
activated T-cell clones in vitro, purified B cells were 
cocultured with-NP-44 cells, stimulated with p94-104, a 
100-fold molar excess of substituted peptide Y96A or 
combinations of p94 104 and Y96A, and IgE production 
was analyzed after 12 days of culture. As shown in Fig. 4, 
C, NP-44 cells, stimulated with p94-104 and excess 
Y96A, were no longer able to provide help to B cells for 
the production of IgE, even in the presence of exoge- 
nous IL-4. To examine whether the presence of antag- 
onistic peptides can interfere with T cell-B cell-medi- 
ated signals leading to IgE synthesis, their effect on the 
induction of CD40L expression was analyzed. T-ce!l 
clones NP-44 and AR-142 expressed significant levels of 
CD40L, as detected by flow cytometry, 6 hours after 
stimulation of the cells with APCs and wild-type peptide. 
As shown in Fig. 5, the presence of a 100-fold molar 
excess of Y96A or N173A during antigen-specific stim- 
ulation of T-ccll clones NP-44 and AR-142 strongly 
inhibited the induction of CD40L expression. Taken 
together, these results suggest that antagonistic peptides 
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FIG. 4, Inhibition of T-cell help for IgE synthesis by antagonistic peptides. Ten thousand highly purified B cells 
(> 99%) were incubated with antl-CD40 antibody {1 p^g/ml) and cultured with 25% dilution of culture 
supernatants from T-cell clones NP-44 (A) and AR-142 (B| that had been stimulated with wild-type peptide (1 
^tg/ml), APCs, and a 100-fold excess of antagonistic peptides for 24 hours as described in Methods section. 
Alternatively, IC^ NP-44 cells that had been stimulated as described in A were washed twice with medium 
cultured with highly purified B cells in presence of ICQ U/ml rlL-4 |C). After 14 days, IgE levels in B-cell cultures 
were measured by ELISA as described in Methods section. 



block Ig isotype switching and subsequent IgE produc- 
tion by B cells in vitro by decreasing IL-4 production by 
Der p 1-specific T-cell clones and by interfering with the 
expression of the CD40L and possibly other surface 
molecules on T cells, required for optimal T cell-B cell 
interaction. 

DISCUSSION 

In this study we demonstrate that certain modified 
peptide sequences, derived from native T-cell activation- 
-inducing sequences in the major house dust mite 
allergen Der p 1, fail to induce proliferative responses 
and cytokine production of Der p 1-specific Tij2-like 
clones. In addition, when added in excess, they strongly 
inhibit allergen-induced activation, resulting in the in- 
ability of these T-cell clones to give help to B cells for the 
synthesis of IgE. An overview of the effects of the 
substituted peptides on T-cell activation is shown in 
Table L These peptides were derived from the p94-104 
and pl71-182 sequences, which are minimal T-cell acti- 
vation-inducing epitopes on Der p I and are recognized 
in the context of HLA DRll and HLA-DR15, respec- 
tively. Single substitutions of Arg^^ TyT^^ Glu^^o^ 
Gln'^^ in p94-104 abolished proliferative responses of 
T-cell clones NP-12 and NP-44. Similarly, stimulation of 
T-cell clone AR-142 with modified peptide pl71-182 
containing substitutions at Asn^^^ VaP^^ G\y''^ Ser^^^, 
Ala^^^ and Gly^^^ by Ala did not induce a proliferative 
response when used at similar concentrations as the 
nonsubstituted peptide. In addition, stimulation of the 
T-cell clones with these modified peptides also failed to 
induce the production of cytokines. These results indi- 
cate that the above-mentioned residues are critical for 
recognition of p94-104 and pl71-182 by the TCR or may 
function as anchors involved in binding to agretopes on 



HLA DRB1*1101 and DRB1*1501, respectively. The 
failure of some of the substituted peptides to induce 
T-cell activation might be due to an impaired interaction 
of these peptides with the TCR, with HLA-DR mole- 
cules, or both. Future studies with MHC-binding analy- 
sis may provide an answer to this question. 

The Der p 1-specific T-cell clones used in this study 
have a TH2-like cytokine production profile, secreting 
high levels of IL-4 and IL-5 and low levels of IFN-7 upon 
activation. The production of the latter cytokine proba- 
bly is due to the presence of IL-12 in the cultures, which 
has been shown to induce the production IFN-7 by 
polarized Th2 cells.^^'^^ Furthermore, IL-2, which is 
produced by human Thi as well as T^^ clones, was 
produced at detectable levels. Stimulation of the T-cell 
clones with the antagonistic peptides resulted in a gen- 
eral inhibition of activation and did not induce a segre- 
gation of proliferation and cytokine production, as has 
been reported for murine^^'^^ and certain human Tho 
clones,^^ In addition, incubation of the Der p 1-specific 
T-cell clones with the modified peptides, at concentra- 
tions required for optimal T-cell proliferation, did not 
induce a state of unresponsiveness as shown previously 
for altered peptides derived from ovalbumin.^^ Our 
results also partially contrast with those of Tsitoura et 
al.^^ and Ikagawa et al/^ who reported that stimulation 
of allergen-specific T-cell clones with altered peptides 
resulted in enhanced production of IFN-7 but did not 
affect the production of IL-4. In view of the latter studies 
showing that single amino substitutions can alter the 
profile of cytokine production by T-cell clones, the 
effects of amino acid residues other than the Ala substi- 
tutions were studied in the antagonistic peptide N173A. 
The initial substitution with the neutral amino acid 
residue, Ala or Ser, produced the most efficient antago- 
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NP-44 



XI 



O 



or 





10^ 10^ 



Log Fluorescence 'intensity 



. expression by antagonistic peptides. NP-44 and AR-142 cells with autologous 

EBV-LCL and w.ld type peptides p94-104or p17M82, respectively, in absence (A) or presence (B)o^3 100 fold 
molar excess of antagonistic peptide as described in Methods section. CD40L expression was analyzed by flow 
cytometry after 6 hours of culture. The x- and >.-axes represent fluorescence (four-decade (og scale) and 
relative cell number, respectively. CD40L expression on resting, nonstimulated T-cell clones is shown in 
nistogram fCy. 



nists, whereas substitution with the hydrophobic He or 
acidic Asp was less effective in inhibiting cytokine pro- 
duction (data not shown). However, none of these other 
substitutions at position 173, albeit resulting in the 
generation of peptides with antagonistic activity, selec- 
tively inhibited cytokine production or proliferation 
alone or changed the Tns-like cytokine secretion pattern 
of the T-cell clones. 

Among the substituted peptides that failed to induce 
activation of the Der p 1-specific T-cell clones, three of 
four derived from p94-104 and two of six derived from 
pl71482 were found to have strong antagonistic activity 
(Tables I and II and Fig. 3). Incubation of the T^ell 
clones in the presence of the nonsubstituted immuno- 
genic peptide in the presence of an excess of these 
antagonistic peptides resulted in a strong inhibition of 
proliferation as well as of the production of IL 2 and 
IFN-7. Because the proliferative responses of the Der p 
1-specific T-cell clones used in this study are strictly 
dependent on the presence of IL-2 despite their Tn^-like 
phenotype (H. Yssel, unpublished results), the inhibitory 
effects of the antagonistic peptides are therefore likely to 
be the result of the strong inhibition of [L-2 production 
as well as of CD25 expression (data not shown). Inter- 
estingly, however, the TCR antagonists had less effect on 
IL-4 and IL-5 production induced by the nonsubstituted 
native peptides. For example, peptide analogues N173A 
and Q181, derived from the pl7l-182 molecule, strongly 
mhibited proliferative responses and IL~2 production 
already at a 10-fold molar excess over the native peptide 
whereas the presence of a 100-fold molar excess of these 
antagonists resulted in the decrease of IL-4 and IL-5 
production of less than 50%. Although these results do 
not show an uncoupling between proliferation and cyto- 
kine production, they do not rule out the possibility that 



the antagonistic peptides may have differential effects on 
signal transduction by the TCR, In particular, the low 
inhibitoiy effects of the antagonistic peptides on IL-4 
production are in line with previously published obser- 
vations, suggesting that the synthesis of IL-4 is less 
dependent on costimulatory signals, as demonstrated by 
the ability of mouse T^a clones to continue to produce 
IL-4 when stimulated with chemically modified APCs.^i 
Moreover, IL-4 production seems to be favored by 
signals resulting from less potent TCR-ligand interac- 
tions, such as those after stimulation of T cells with 
peptides that have a low affinity for the TCR or for MHC 
class U molecules.^i In addition, it has been reported 
that mouse Thi and Th2 clones use different T-cell 
signahng pathways, suggesting that TCR-mediated sig- 
naling is likely to differ between IL-4 and IFN-7 produc- 
tion.^^ On the other hand, however, because the antag- 
onistic peptides, when used by themselves had an "all or 
none" effect on the cytokine production profile of the 
Der p 1-specific T-cell clones, it cannot be excluded that 
the observed differences in sensitivity between the vari- 
ous cytokines, after stimulation of the cells in the 
presence of antagonist and native peptide, may be the 
result of differences in the threshold of the activation 
signal or signals mediated by the TCR, rather than 
differences in signaling pathways. 

The synthesis of IgE by B cells has been shown to be 
specifically induced by IL-4 and IL-13 in the presence of a 
signal provided for by the interaction of CD40, expressed 
on B cells and CD40L, expressed on activated T cells.^-^ 
Moreover, lL-4/IL-13-induced IgE synthesis is inhibited by 
IFN-7.^ Culture supernatants of T-cell clones NP-44 and 
AR-142 that had been activated with specific peptide in the 
presence of excess antagonistic peptides Y96A and N173A, 
respectively, were still able to induce anti-CD40-driven 
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IgE synthesis by B cells. However, levels of IgE synthesis 
were inhibited by approximately 50% to 75%, compared 
with those induced by T-cell supernatants generated after 
stimulation of the T-cell clones with native peptides alone. 
These results reflect the partial inhibitory effects on the 
reduced levels of IL-4 and, most likely, IL-13 production by 
the T-cell clones. In addition, T-cell clone NP-44, activated 
in the presence of native peptide p94-104 and excess 
antagonistic peptide Y96A, was no longer able to give help 
to B cells for the synthesis of IgE, even in the presence of 
exogenous IL-4. The inhibitory effects on these antagonis- 
tic peptides on T cell-mediated IgE synthesis appear to be 
the result of their capacity to downregulate or prevent the 
induction of CD40L expression on the T cells. Because 
these results underscore the notion that the antagonistic 
peptides used in this study do not appear to induce 
uncoupling of TCR-mediated signals leading to prolifera- 
tion, cytokine synthesis, and expression of cell surface 
molecules, it is likely that the expression of other cell 
surface molecules that may be associated with productive T 
cell-B cell interaction, such as membrane-bound tumor 
necrosis factor (TNF)-a,^^ are downregulated as well. 
Taken together, these results demonstrate that antagonistic 
Der p 1- derived peptides are able to effectively inhibit Th2 
cell-mediated IgE synthesis by decreasing the production 
of the IgE synthesis-inducing cytokine IL-4, as well as by 
inhibiting the functional expression of cell surface mole- 
cules, which are important for productive T cell-B cell 
interactions. 

In conclusion, in this study we describe the gener- 
ation of antagonistic peptides, after the substitution of 
single amino acid residues that are effectively inter- 
fering with the activation of and help for IgE synthesis 
by Der p 1-specific T-cell clones. Although the acti- 
vation of the T-cell clones in this study was carried out 
with wild-type Der p 1 -derived peptides, identical 
inhibitory effects of the antagonistic peptides were 
also observed when the T-cell clones had been acti- 
vated with the native Der p 1 allergen (data not 
shown). In this respect, these peptides may be of 
potential interest for therapeutic use. It must be 
noted, however, that house dust mite extracts consist 
of a large number of major allergens, each containing 
multiple T-cell activation-inducing epitopes.'^- ^^'^^ In 
addition, allergens are recognized in the context of 
different HLA class II molecules, which hampers 
efforts to design immunotherapy protocols with the 
aim of inactivating allergen-specific T cells. It is 
therefore important to state that despite their effec- 
tiveness in inhibiting IgE synthesis in vitro, more 
information is required to evaluate the potential use 
of antagonistic allergen-derived peptides in immuno- 
therapy. 
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Summary: Exciting advances in biotechnology have led to the development of 
innovative biological products to improve health care. In this review article, 
the major techniques of product development and manufacturing in biotech- 
nology are discussed, with a focus on recombinant DNA technology. In addi- 
tion* monoclonal antibody, nucleotide blockade, polymerase chain reaction, 
antisense, and gene therapy technologies will be defined briefly. For recom- 
binant-DNA technology, the issues of gene isolation, gene cloning, protein 
expression, scale-up (manufacturing), and quality assurance are addressed. 
The 16 approved products and the research pipeline are characterized. In 
addition, major usage issues for biological products are noted. This review 
serves as an introduction to the science and applications of biotechnology 
present and future. Key Words: Biotechnology— Recombinant DNA technol- 
ogy—Biological products. 



During the past two decades, biotechnology has 
evolved from a lafaoratory science to a dynamic re- 
search-based business and an established form of 
therapy that benefits patients everyday. The sophis- 
ticated science of biotechnology continues to unfold 
and is expanding rapidly, from the core technolo- 
gies of recombinant DNA procedures and monoclo- 
nal antibodies to whole new techniques, such as 
antisense (anti-RNA code blockade) (1). The bio- 
technology revolution is a success because of the 
scientific ingenuity and entrepreneurial spirit of mo- 
lecular biologists, biochemists, and investment cap- 
italists, who have formed > 1,000 biotechnology re- 
search companies in the United States that develop 
biological products to mitigate the effects of dis- 
eases in new ways. 

Two indispensable factors in this success have 
been substantial research collaboration with univer- 
sities and research and regulatory cooperation with 
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government agencies. A 1991 survey showed that 
742 biotechnology firms had -$I2 billion in revenue 
(2). In 1992, the top 79 biotechnology companies, 
according to stock market analysts, generated 
$2,127 billion in revenue (3), which was corrobo- 
rated in a recent review article (13 molecules were 
responsible for $2.14 billion in sales in 1992) (4). 
The research and development commitment is im- 
pressive as well; 742 companies (1991 survey) re- 
portedly invested >$3.63 billion in research (-30% 
of revenues) (2). From 1990 to the year 2000, the 
U.S. biotechnology market is projected to grow ten- 
fold, including pharmaceuticaL diagnostic, agricul- 
tural, and environmental products. This review ar- 
ticle focuses on the scientific principles of biotech- 
nology, product development issues, and products 
currently in use and under development. 

DEFINITIONS 

The molecules or biological products created by 
biotechnology research and manufacturing to dale 
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are all proteins. The manufacturing process for 
these molecules uses a living system, usually a cell; 
this cell produces all of its normal proteins plus one 
addiiional protein, very similar to naturally occur- 
ring human molecule. Van Noordwijk offered a 
broadly applicable, yet simple definition of biotech- 
nology: "the use of tissue cultures, living cells or 
cell enzymes to make a defined product'* (5), More 
precise definitions exist and often delineate specific 
processes that make up biotechnology, such as hy- 
bridoma, cell fusion, and recombinant DNA. Fur- 
thermore, biotechnology can be described as a com- 
posite of several key scientific disciplines- 
molecular biology, microbiology, biochemistry, 
immunology, genetics, and engineering. 

In the brief time period from 1982 to 1993, bio- 
logical products have become widely available (15 
molecules and 21 products by the fall of 1993). 
These products are being used in hundreds of thou- 
sands of patients annually for such common mala- 
dies as diabetes, cancer, heart attacks, and anemia. 
Biotherapy is a term now used to characterize the 
clinical use of these products. Its definition encom- 
passes the administration of proteins, which often 
are duplicates or functional equivalents of naturally 
occurring molecules in the human body, in order to 
enhance normal body functions or replace deficient 
or dysfunctional proteins. Biotherapy is being inte- 
grated into patient care along with the established 
therapies of drugs, diet, surgery, physical proce- 
dures, radiation, and psychotherapy. 

DESCRIPTION OF RECOMBINANT 
DNA TECHNOLOGY 

Molecular biology is based on the central tenet 
that DNA makes RNA makes protein (6). This pro- 
cess comprises three major steps: transcription, 
translation, and completion of the protein. Genes 
are DNA nucleotide sequences, the basic units of 
heredity, and the starting material for recombinant 
DNA (r-DNA) technology. In the 23 pairs of chro- 
mosomes in the nucleus, made up of three to four 
billion base pairs of nucleotides, we need to locate 
the one gene of 50,000 genes responsible for pro- 
ducing a specific protein. In addition, genes contain 
coding DNA sequences (exons) responsible for pro- 
tein production; a leader DNA sequence to initiate 
transcription; noncoding DNA material (introns), 
which must be eliminated in the formation of mes- 
senger RNA (m-RNA); and nucleotide sequences 
that indicate the terminal end of a gene and turn off 



transcription- The end product of r-DNA technol- 
ogy is a protein, which includes a set of amino acids 
of the correct number, type, and sequence. Several 
structural requirements are also critical for optimal 
activity of proteins; these include disulfide cross- 
linkage, three-dimensional folding, and glycosyla- 
tion with sugar moieties of the right number, loca- 
tion, and type. 

The steps in the process and the basic elements of 
recombinant DNA technology are outlined in Table 
1 and are described herein. Recombinant DNA 
technology starts with the step of gene isolation, 
which includes three methods. One method begins 
with protein analysis; the composition of the pro- 
tein must be identified fully with regard to the type 
and sequence of amino acids. Because the DNA 
codons (triplets) for each amino acid already are 
known, the DNA sequence then is synthesized in 
reverse from the protein. A second method involves , 
isolation of the m-RNA that is responsible for pro- 
ducing a specific protein. Then, the viral enzyme, 
reverse transcriptase, is used to create the comple- 
mentary DNA for the m-RNA. A third method is a 
screening process of the full genomic library when 
the protein cannot be identified fully. From two 

TABLE 1. Steps in process and basic elements of 
recombinant DNA technology 



Step 1 

Gene isolation (three alternatives with different starting 
materia]) 

From protein analysis to DNA triplets to DNA 
synthesis 

From m-RNA identification plus viral reverse 

transcriptase to c-DNA formation 
From chromosome pool with DNA probes to DNA 

screening for gene 

Step 2 

Cloning/expression (from genes to proteins in the 
laboratory) 

Plasmid and gene-starting DNA materials 
Restrictive endonuclease enzyme for plasmid opening 
DNA ligase enzyme for plasmid closure with human 
gene 

Recombinant plasmid entry into host cell 
Host ceil/plasmid cloning (duplication) 
Gene expression of protein by host cells 

Step 3 

Scale-up process (manufacturing of commercial product) 
Inoculum — daughter ceils from master cell bank 
Cell culture — serial fermentation — crude protein mixture 
Purification — desired protein in bulk 
, Formulation—fmal product 
Step 4 

Quality control measures (tests of products and process) 
Plasmid/host cell— integrity assessment 
Bulk product — structure, potency, purity, activity 
Process vaiidation— efficiency and reproducibility 
Final product—for bulk product, plus stability, sterility 



Tner Drug Monit. Vol. 15, No, 6, 1993 



Matoricil max/ r\m\ac-^cir\ K\/ non\/rinh+ la\A/ /^Titlo -17 I I C C^r\r\ci\ 



R, P, EVENS AND M, WITCHER 



• 516 

separate peptide segments of the protein, two dis- 
tinct oligonucleotide probes matching the two pep- 
tides are created. Next, the chromosome is dena- 
tured into several hundreds of thousands of DNA 
segments. This pool of DNA. segments is searched 
using each oligonucleotide probe, A DNA segment 
that matches both probes would be a likely clone 
representing the target gene. 

The second step in product development for 
r-DNA technology is cloning and expression, which 
is worked out in the laboratory. Cloning is duplica- 
tion of a gene and its host cells. Expression is pro- 
duction of the protein from the gene in a host cell. 
The components of this step include the gene, plas- 
mids, restriction endonucleases, DNA ligases. 
DNA linkers, DNA promoters/enhancers, and host 
cells, 

Plasmids are circular rings of DNA that serve as 
the vector to carry the human gene into host cells. 
Key properties of plasmids are that they are DNA 
material, they have the capacity for autonomous 
replication of DNA, they allow ready acceptance of 
genes, they are transfi^rable between cells, and they 
are stable during the ^recombinant process. Their 
source is bacteria, where they are responsible for 
transference of virulence or resistance, e.g., the 
pBR322 plasmid. Restriction endonuclease en- 
zymes are employed to cut open plasmids at spe- 
cific nucleotide triplets; a certain enzyme is chosen 
to open the piasmid at sites compatible with the 
terminal triplets of a gene. Bacteria again provide 
this tool, with -iO(K-200 distinct enzymes available. 

DNA ligase enzyme is used to recombine the 
DNA material, incorporating the human gene into 
the bacterial plasmid. If the gene does not permit 
full closure of the plasmid ring, a DNA linker (seg- 
ment of DNA) is added to piece together the gene 
with the plasmid ring. Sometimes a special DNA 
segment from a viral or bacterial source is added to 
the gene/plasmid DNA ring to increase gene repli- 
cation and protein production. This recombined 
plasmid is then transferred to a host cell. 

Host cells need to possess three primary charac- 
teristics: a short reproductive cycle, ease of growth 
in vitro, and acceptability of plasmids. Host cells 
can be fungal (e.g., yeast), bacterial (e.g., E, coli), 
or mammalian (e.g., Chinese hamster ovary) cells. 
The host cells with the recombined plasmids are 
. placed in special nutrient mixtures designed for op- 
timal cell viability and protein production. These 
cells are cloned (duplicated) along with intracellular 
plasmid multiplication. These host cells retain the 



capacity to produce all the various proteins in their 
normal life cycle and express one more protein, 
which is the duplicate or functional equivalent of 
the human protein produced by human gene expres- 
sion. The cloning process results in a master work* 
ing cell bank of special new parent cells capable of 
producing human proteins. 

The site of manufacture of proteins in r-DNA 
technology is living cells. Scale-up of protein pro- 
duction is required to manufacture sufficient 
amounts of protein from very large cell colonies to 
meet patients' needs in a reasonably cost-effective 
manner. The manufacturing processes primarily are 
twofold, based on the type of host cells, that is, 
mammalian (eucaryote) or microbial (prokaryote) 
cells. In mammalian cell culture, cells grow slowly, 
with a doubling time of -^-24-36 h. A production lot 
can require more than a month to produce. Promi- 
nent challenges are to maintain an environment that 
remains sterile, keeps cells viable and productive, 
and avoids any contamination over a 1-2 month pe- 
riod. Mammalian host cells secrete the proteins ex- 
tracellularly, which enhances harvesting of proteins 
from media. Chinese hamster ovary cells are the 
mammalian cell line employed for epoetin alfa 
(EPOGEN) and alteplase (Activase), For microbial 
cell cultures, the cell doubling time can be as little 
as 30 min, which is highly desirable and, hence, is 
preferred for producing proteins. Microbial fermen- 
tation is an easier process and is more frequently 
used in r-DNA manufacturing. It is employed for 
filgrastim (NEUPOGEN). Microbial cells do not se- 
crete proteins extraceilularly and store proteins in 
vesicles, which complicates harvesting of proteins. 

Four primary steps are involved in scale-up for 
mammalian r-DNA manufacturing: inoculum, cell 
culture, purification, and formulation. The inocu- 
lum step entails obtaining a single vial of daughter 
ceils from the parent cells in the master working ceil 
bank. These daughter cells are grown in vitro in 
small spinner flasks to initiate the manufacturing 
scale-up. This initial cell culturing expands cell vol- 
ume with serially larger spinner flasks; cell density 
in the flasks is the critical factor in maximizing ceil 
growth. The goal of this inoculum phase is to pre- 
pare a sufficient number of cells for conduct of the 
cell culture phase. Cell culture production is the 
core process in the scale-up in r-DNA technology: 
the proteins are produced by serial fermentation, m 
which the amount of cloning (number of functioning 
cells, volume of growth media, and protein produc- 
tion) are increased substantially in a stepwise fash- 
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ion. This cell culture process can employ a large 
ferinenter wherein the cells float free in the me- 
dium, that is, a suspension culture. Alternatively, 
attachment technology, e,g.» a fluidized bed fer- 
menter or roller bottles, is used, wherein cells are 
fixed on a surface and are bathed in media. 

The fermentation process requires careful control 
of the cell environment, including the nutrient me- 
dium, the air mixture (oxygen, nitrogen, carbon di- 
oxide, and moisture), the breathability of the fer- 
mentation flasks (between flasks and ambient envi- 
ronment), the temperature of flasks, and the 
removal of cellular waste. Media shifts in roller bot- 
tles are performed periodically to maintain cell vi- 
ability. Harvesting of protein from the medium is 
done at several times during the cell culture pro- 
cess, along with a step to concentrate the liquid 
mixture to create a crude multi-protein mixture. 
Throughout this fermentation period, the cells must 
be kept alive and functional without mutation, 
change, or contamination (e.g., viruses or onco- 
genes). 

Purification of the crude protein is accomplished 
with a sequence of chromatographic steps to sepa- 
rate the desired protein from the crude mixture. The 
physical properties of the proteins (such as the size 
of fnolecules, the hydrophobic properties, and elec- 
trical charge) allow this separation to be done. The 
common chromatographic methods include gel- 
filtration, ion-exchange, affinity, reversed-phase, 
and hydrophobic techniques. Sterile filtration is 
then performed, to obtain a purified bulk of target 
protein without contaminating material or organ- 
isms. The final step, formulation, requires mixing 
the protein with a diluent plus stabilizers and buff- 
ers specific for that protein product. FilHng and la- 
beling of ready-to-use vials is necessary. Some bi- 
ological products are lyophilized attempting to im- 
prove a product's shelf life, although reconstitution 
by practitioners then is required before use. 

For microbial fermentation, the cell culture step 
uses one or several large fermenters containing the 
appropriate medium. Because of the high cell- 
growth rate, the fermenter has the capacity to force 
air into the media to maintain cell viability. At the 
end of fermentation, the cells can be killed without 
denaturing the proteins that are stored in intracel- 
lular inclusion bodies. The contents of the fer- 
menter often are centrifuged to create a cell paste. 
Recovery of proteins from this microbial system 
usually calls for extraction of the inclusion bodies 
from the dead cells. The cells are broken up with a 



homogenizer or other high-shear device to obtain 
the inclusion bodies, A high-shear device also is 
employed to process the inclusion bodies and re- 
move the bulk protein. The next step is purification 
and formulation similar to what we have described 
herein. 

Quality control is a substantial challenge in the 
biotechnology industry, with >70 tests employed to 
ensure the integrity, quality, and activity of the in- 
termediates as well as the final product. Biological 
products usually are proteins; as such, they are 
more complex compounds, have a more detailed 
and sophisticated manufacturing process, and are 
made in living cells, in contrast to the chemical syn- 
thesis and/or extraction processes used for drugs. 
These three features lead to a potentiaDy greater 
opportunity for contamination, alteration, or dena- 
turation of the protein product. 

Four areas are addressed by quality control, that 
is, plasmids and host cells, bulk product, process 
validation, and final product batches. First, a guar- 
antee of the integrity of host cells and plasmids ne- 
cessitates, for example, chromosomal analysis of 
'karyotype, contamination screens for viruses and 
oncogenes, and gene stability assessment. Second, 
testing of bulk product is extensive for a protein 
encompassing, for example, high-pressure liquid 
chromatographic analysis, radioimmunoassay, 
Western and Southern blot analyses, amino acid se- 
quencing, peptide maps, and even bioassay. The 
goals of such a large number of sophisticated tests 
are to ensure structure, purity, potency, and activ- 
ity of the protein. Third, fmaJ product is evaluated 
with the same tests as for balk products, along with 
standard assessments of final formulation (such as 
tests of sterility, excipients, volume, appearance, 
and stability). Fourth, process validation tests are 
intended to address the efficiency and reliability of 
the manufacturing process. 

TECHNIQUES OF BIOTECHNOLOGY 

Although recombinant DNA technology is re- 
sponsible for >90% of the available biological prod- 
ucts, biotechnology has evolved to create several 
key technologies for product development, as rep- 
resented in Table 2. Monoclonal antibody (MAb) 
technology has been in use for the past 20 years and 
has led to many diagnostic advances, e.g., ELISA 
(enzyme-linked immunosorbent assays) test kits for 
drug testing and OncoScint CR103, radiolabeled 
monoclonal antibodies for diagnosis of colorectal 
cancer. Therapeutic applications are limited at this 
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TABLE 2. Techniques of biotechnology 

Recombinatit DNA technology 
Monoclonal antibody techtiique 
Polyxnerase chain reaction 
Gene therapy 
Nucleotide blockade 
Peptide technology 
Carbohydrate technology 



time to one product, antirejection (kidney) therapy 
with murononab, but many products are in devel- 
opment, especially for cancer therapy. Manufacture 
of a MAb requires creation of a hybridoma from 
mouse B cells, which produce specific antibodies 
against a target antigen, and from myeloma cells, 
which have long lives and produce large amounts of 
antibodies (7). The hybridoma possesses properties 
of both cell components. It is cloned and grown to 
yield antibodies, which are then harvested and pu- 
rified. 

The polymerase chain reaction is a biotechnology 
whereby substantial amplification (over several 
100,000-fold) of a target nucleic acid sequence (e.g., 
a gene) is obtained (8). This enzymic reaction hap- 
pens in repeated cycles of a three-step process: (a) 
DNA is denatured to separate the two strands; (b) a 
nucleic acid primer is hybridized to each DNA 
strand at a specific location in the nucleic acid se- 
quence; and (c) DNA polymerase enzyme is added 
for extension of the primer along the DNA strand to 
copy the target nucleic acid sequence. Each cycle 
doubles the DNA molecules copied, and the cycle is 
repeated until sufficient DNA sequence material is 
copied, e.g., 20 cycles with a 90% success rate 
yields a 375,000 amplification of a DNA sequence. 

Gene therapy is employed in inheritable diseases 
where a protein is either dysfunctional or not pro- 
duced because of an abnormal gene (9). These com- 
mon and life-threatening diseases include cystic fi- 
brosis, hemophilia, sickle cell anemia, and diabetes. 
The gene responsible for the malady must be iden- 
tified first, which was done, for example, for ADA 
(adenine deaminase) enzyme deficiency. This dis- 
ease leads to a severely compromised immune de- 
ficiency (SCID), often causing death in childhood or 
adolescence. The normal human gene for the fully 
functional protein is cloned and inserted into a car- 
rier, such as a harmless virus. A patient's cells, 
e,g., T lymphocytes, are grown in the laboratory, 
and the cells receive the gene from the viral carrier. 
The patient's cells start producing the missing pro- 
tein to correct the deficiency. These cells with the 
extra fijnctional gene then are returned to the pa- 
rser Drug Monlt, Vol 35, No. 6. 1993 



tient, and the normal protein is produced and re- 
leased, alleviating the disease. For SCID and other 
diseases, gene therapy is life-saving. 

Nucleotide blockade or antisense focuses on ar- 
resting expression of dysfunctional messenger RNA 
or DNA (10). A complementary m-RNA is created 
to match an abnormal m-RNA. The two m-RNA 
strands complex together, preventing translation of 
the m-RNA to form disease-producing proteins. 
Anti-DNA strands also can be created to complex 
with DNA to form a triple helix. Ohgonucleotides, 
or short single strands of nucleic acids, instead of 
the full m-RNA also can be employed to block RNA 
expression. Viral disease (herpes simplex and HIV) 
and cancer (oncogenes) are two targets of this form 
of biotechnology. 

Peptide technology entails screening for polypep^ 
tide molecules that can mimic large proteins, thus 
affording more simple products that may be more 
stable and easier to produce. The peptides can 
serve as protein receptor agonists or antagonists. 
Carbohydrate chemistry is targeted at inflammatory 
and immune diseases involving leukocyte activa- 
tion. Expression of carbohydrate molecules from 
cell surfaces (cell adhesion molecules) can be pre- 
vented, arresting cell migration or activation and, in 
turn, moderating the leukocyte-related components 
of inflammation. 

These brief descriptions of biotechnology tech- 
niques now employed in product development ex- 
pose the vast potential for many and varied biolog- 
ical products, which will modify or correct many 
diseases. The biological products now in develop- 
ment in clinical trials well exceed 100 molecules. 

BIOLCXjICAL PRODUCT AVAILABILITY 

Fifteen biological molecules have been developed 
and marketed in the years from 1982 through mid- 
1993. Fourteen companies are marketing 20 prod- 
ucts. Table 3 lists the molecules, products, compa- 
nies, indications, and year of availability. Some 
products are used for single indications [such as 
insulin for diabetes (1982) or Factor Vlll for hemo- 
philia (1992)]. However, many products are ap- 
proved first for one major indication and then, after 
substantial clinical research into additional indica- 
tions, are approved for other uses at a later time. 
Examples include interferon a-2b (Intron A) for 
hairy cell leukemia (1986), AIDS-related Kaposi's 
sarcoma (1988), hepatitis C (1991), and hepatitis B 
(1992), and epoetin a (EPOGEN, Procrit) for ane- 



1 re^ 




•ther 




1 ar- 




INA 


■ 


ated 




INA 


i 


til of 


t 


iins. 




plex 




des, 




dof 




INA 




nv) 




'orm 




pep- 




thus 


1 


aore 


1 


can 




ists. 


1 


ivi y 




tiva- 




Tom 


1 


pre- 


1 


in 


1 


ents 


i 


ech- 


1 


: ex- 








lany 




:lop- 









AN INTRODUCTION TO BIOTECHNOLOGY 



TABLE 3, Biological products" 



519 



Generic name 



Trade name 



Therapeutic area 



Human insulin 

Human growth hormone 

Hepatitis B vaccine 

Interferon a-2a 

Interferon a-2b 



Muromonab-CD3 
Alteplase 

Epoetin a 



Interferon a-nS 
Interferon 7- lb 

Filgrastim (G-CSF) 



Sargramosiim (GM-CSF) 

Aldesleukin 
Factor VIII 
Interferon p 



Humulin (Lilly) 

Protropin (Genentcch) 
Humatrope (Lilly) 
EngcriX'B (SmithKline Bcecham) 
Recombivax HB (MSD) 

Roferon A (Hoffman LaRoche) 



Intron A (Schering-Plough) 



Orthoclone OKT 3 (Ortho Biotech) 
Activase (Genentech) 



Epogen (Amgen) 
Procrit (Ortho Biotech) 

Alferon N (Interferon Sciences) 
Actimmune (Genentech) 

Neupogen (Amgen) 



Leukine (Immunex) 
Prokme (Heochst-Koussel) 

Proleukin (Chiron) 

KoGenate (Miles) 
Recombinate (Baxter) 
Betaseron (Chiron, Berlex) 



Insulin-dependent diabetes 

mellitus (1982) 
Human growth hormone 

deficiency in children (1985) 
Hepatitis B prophylaxis (1986) 



Hairy ceil leukemia (1986); 

AIDS-related Kaposi's 

sarcoma (1988) 
Hairy cell leukemia (1986); 

AlDS-related Kaposi's 

sarcoma (1988); chronic 

hepatitis types B (1992) and C 

(non-A. non-B) (1991); 

condylomata acuminata (1988) 
Acute allograft rejection in renal 

transplant patients (1986) 
Acute myocardial infarction 

(1987); pulmonary embolism 

(1990) 

Certain anemias — chronic renal 
disease (1989); AIDS (1991); 
cancer chemotherapy (1993) 
Condylomata acuminata (1989) 
Chronic granulomatous disease 
(1990) 

Febrile neutropenia (infection) 

due to myelosuppressive 

chemotherapy (1991) 
Myeloid reconstitution after 

bone marrov; transplantation 

(1991) 

Metastatic renal cell carcinoma 

(1992) 
Hemophilia A (1992) 

Multiple sclerosis (1993) 



G-CSF, granulocyte colony-stimulating factor; GM-CSF, granulocyte-macrophage colony- 
stimulating factor. 

List of products is given in order of date of first product approval. 



mia stemming from chronic renal disease (1989), 
from AIDS-related antiviral therapy (1991), and 
from cancer chemotherapy (1993). The disease en- 
tities treated by these 15 molecules are common- 
place and address most organ systems and 18 sep- 
arate conditions. They can be used in endocrinol- 
ogy (diabetes, growth), hepatitis (types B and C), 
oncology (hairy cell leukemia, AIDS-related Kapo- 
si's sarcoma, renal cell carcinoma, neutropenia, 
and infections from chemotherapy), anemias, car- 
diology (heart attacks, pulmonary embolism, hemo- 
philia), neurology (multiple sclerosis), and organ 
(kidney) rejection. This brief description establishes 
profound impact of biotechnology on health 
care, bringing therapeutic agents to treat diseases 
*at often were previously untreatable. A new arm 
of therapy, biotherapy, has been created. 



FUTURE BIOTECHNOLOGY PRODUCTS 

The first decade of biotechnology drug develop- 
ment produced an admirable record of one new bi- 
ological molecule per year, on average. This early 
success is expected to be followed by a host of 
products in similar and new therapeutic areas. Ac- 
cording to 1991 Pharmaceutical Manufacturing As- 
sociation (PMA) data, 132 biological products were 
in clinical development and 21 molecules were at 
the Federal Drug Administration (FDA) for regula- 
tory review and potential approval (11). The time 
frame to bring such new biological products to mar- 
ket (patient availability) has been documented to 
average ---4.8 years in the United States and 3 years 
in Europe; two-thirds of this time is consumed by 
clinical research and one-third by regulatory review 



Ther Drug Monit, Vol 75, hio. 6, 1993 



520 



R, P. EVENS AND M. WITCHER 



(12) . According to PMA data, from 1982 to 1992, 
FDA review of product license applications for bi- 
ological products averaged 24 months, and new in- 
dications took 30 months for review and approval. 
Over the past 5 years, the research pipeline has 
grown from <80 to >130 products (60% growth), 
and the expected time required for product research 
and approval is --5 years. 

These new biotherapies incorporate both exten- 
sions of current product categories and whole new 
entities. Table 4 cites 10 product categories for fu- 
ture biological products and includes >150 new 
products or indications that are in development 

(13) . Most therapeutic areas are being targeted by 
biotechnology, but oncology has the largest number 
of products; some two-thirds to three-quarters of 
interferons, interleukins, and monoclonal antibod- 
ies are used in this area. In vaccines, AIDS is the 
major focus. Tissue-growth factors, including epi- 
dermal growth factor, fibroblast growth factor, and 
platelet-derived growth factor, address repair of 
wounds due to injury, surgery, chronic disease, or 
decubiti. Clotting factors include proteins to im- 
prove clotting and to dissolve clots. The last cate- 
gory (labeled 'Warious") includes such molecules 
as dismutases, tumor necrosis factor, and soluble 
CD4, 



TABLE 4, Future biotherapies 




Number of indications 


Category 


and products 


Antisense 


5 


Clotting factors 


8 


Hematopoietic factors 


12 


Hormones 


7 


Interferons 


16 


interleukins 


13 


Monoclonal antibodies 


58 


Tissue growth factors 


11 


Vaccines 


18 


Various 


>15 



See Burrill and Lee for summary (13). 



Biological products have become the mainstay of 
therapy for major diseases already benefiting mil- 
lions of patients each year, with only 15 molecules 
and 20 products in use to date. Future products and 
indications exceed 100 in number. The full impact 
of these therapies and the benefits to patients stili 
are being discovered. Biotherapy holds the promise 
of a natural therapy, wherein proteins, nucleotides^ 
and other human molecules are recreated and em- 
ployed to ameliorate or even cure disease. We are 
on the threshold of a pharmacologic and therapeutic 
revolution, 
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Mapping and mutational analysis of the IgE-binding epitopes on Ara h 1, 
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Peanut allergy is a significant health problem because of the prevelance and potential severity of the 
allergic reaction. Serum IgE from patients with documented peanut hypersensitivity reactions and overlap- 
ping peptides were used to identify the IgE-binding epitopes on the major peanut allergen, Ara h 1. At 
least twenty-three different linear IgE-binding epitopes, located throughout the length of the Ara h 1 
protein, were identified. All of the epitopes were 6-10 amino acids in length, but there was no obvious 
sequence motif shared by all peptides. Four of the peptides appeared to be immunodominant IgE-binding 
epitopes in that they were recognized by serum from more than 80% of the patients tested and bound 
more IgE than any of the other Ara h 1 epitopes. Mutational analysis of the immunodominant epitopes 
revealed that single amino acid changes within these peptides had dramatic effects on IgE-binding charac- 
teristics. The identification and detenni nation of the IgE-binding capabilities of core amino acids in 
epitopes on the Ara h 1 protein will make it possible to address the pathophysiologic and immunologic 
mechanisms regarding peanut hypersensitivity reactions specifically and food hypersensitivity in general. 

Keywords: food allergy; IgE-binding epitope; epitope analysis. 
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Approximately 8% of children and 1-2% of adults have 
some type of food allergy [1]. Peanuts, fish, tree nuts, and shell- 
fish account for the majority of food hypersensitivity reactions 
in adults, while peanuts, milk, and eggs cause over 80% of food 
hypersensitivity reactions in children [2]. 

Food hypersensitivity reactions occur shortly after contact of 
a specific allergen with its corresponding IgE antibodies which 
are bound to mast cells. Allergen-specific IgE, when cross- 
linked by the respective allergen, activates the mast cells to re- 
lease histamine, heparin, and other mediators responsible for the 
clinical symptoms observed. Thus the IgE-binding epitopes of 
the allergens play an important role in the disease process. Their 
characterization will provide a better understanding of the hu- 
man immune response involved in food hypersensitivity reac- 
tions. If improved diagnostic and therapeutic capabilities are to 
be developed it is important to determine the primary structure 
and frequency of recognition of any IgE-binding epitopes con- 
tained within the allergen. 

Various studies have shown that the most allergenic portion 
of the peanut is the protein fraction of the cotyledon [3]. A major 
allergen found in the cotyledon is the peanut protein, Ara h 1 
[4]. The Ara h 1 allergen belongs to the vicilin family of seed 
storage proteins [5]. This protein is recognized by greater than 
90% of peanut-sensitive patients, thus establishing it as an im- 
portant allergen [4]. The majority of serum IgE recognition of 
the Ara h 1 allergen appears to be due to epitopes within this 
protein that are linear amino acid sequences that do not contain 
significant amounts of carbohydrate [4, 5J. Previous results have 
demonstrated similarity between the level of IgE binding to re- 
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combinant Ara h 1 protein and the native forin of this allergen 
when individual patient serum was tested [5]. These results indi- 
cated that the recombinant protein could be considered for use 
in both diagnostic and immunotherapeutic approaches to peanut 
hypersensitivity. 

Because of the prevalence and seventy of peanut hypersensi- 
tivity reactions in both children and adults, and the role B-ceil 
epitopes play in these reactions, we set out to map and character- 
ize the major IgE epitopes of the Ara h 1 allergen. In this study, 
we report the primary structure of the Ara h 1 IgE-binding epi- 
topes recognized by peanut hypersensitive individuals. Four epi- 
topes that bound peanut-specific serum IgE from more than 80% 
of patients tested were identified. The amino acids important 
to peanut-specific IgE recognition of these epitopes were then 
determined for the purpose of using them in future diagnostic 
and immunotherapeutic approaches to this disease. 



MATERIALS AND METHODS 

Serum IgE. Serum from 15 patients with documented pea- 
nut hypersensitivity reactions (mean age, 25 y) was used to iden- 
tify the Ara h 1 IgE-binding epitopes. Each of these individuals 
had a positive immediate prick skin test to peanut and either 
a positive double-blind placebo-controlled food challenge or a 
convincing history of peanut anaphylaxis (laryngeal edema, se- 
vere wheezing, and/or hypotension). Representative individuals 
with elevated scrum IgE levels (who did not have peanut-spe- 
cific IgE or peanut hypersensitivity) were used as controls in 
these studies. In some instances, a serum pool was made by 
mixing equal aliquots of serum IgE from each of the 15 patients 
with peanut hypersensitivity. This pool was then used in immu- 
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Fig 1 There arc multiple predicted antigenic sites on the Ara h 1 allergen. The amino acid sequence of the Ara h 1 protein was analyzed for 
potential antigenic sites by the Jameson and Wolf (1988) algorithm. TJicre were 11 (1-11) predicted regions that contained multiple antigenic sites 
(octagons) along the entire length of the molecule. The amino acid sequence is represented as a-helical (sinusoidal curve), ^-sheet (saw tooth curve), 
and coil (flat sinusoidal curve) conformations. P turns are denoted by chain reversals. 



noblot analysis experiments to determine the IgE-binding char- 
acteristics of the population. At least 5 ml venous blood was 
drawn from each patient and allowed to clot, and the serum 
collected. All studies were approved by the Human Use Advi- 
sory Committee at the University of Arkansas for Medical Sci- 
ences. 

Computer analysis of Ara h 1 sequence. Analysis of the 
Ara h 1 gene [5] and peptide sequences was performed on the 
University of Arkansas for Medical Sciences' Vax computer 
using the Wisconsin DNA analysis software package. The pre- 
dicted antigenic regions on the Ara h 1 protein are based on 
algorithms developed by Jameson and Wolf [6] that relate anti- 
genicity to hydro-philicity, secondary structure, flexibility, and 
surface probability. 

Peptide synthesis. Individual peptides were synthesized on 
a cellulose membrane containing free hydroxyl groups using 
Fmoc-amino acids according to the manufacturer's instructions 
(Genosys Biotechnologies). Synthesis of each peptide was 
started by esterification of an Fmoc-amino acid to the cellulose 
membrane. After washing, all residual amino functions on the 
sheet were blocked by acetylation to render them unreactive dur- 
ing the subsequent steps. Fmoc protective groups were then re- 
moved by addition of piperidine to render nascent peptides re- 
active. Each additional Fmoc-amino acid is csterified to the pre- 
vious one by this same process. After addition of the last amino 
acid in the peptide, the amino acid side chains were de-protected 
using a mixture of 1 : 1 :0.05 (by vol.) dichloromethane/trifluoro- 
acetic acid/triisobutylsilane, followed by washing with dichloro- 
methane and methanol. Membranes containing synthesized pep- 
tides were either probed immedately with serum IgE or stored 
at -20°C until needed. 

IgE-binding assay. Cellulose membranes containing synthe- 
sized peptides were incubated with the serum pool or individual 
serum from patients with peanut hypersensitivity diluted (1:5) 
in a solution containing Tris/NaCi (10 mM Tns/HCl, 500 mM 
NaCl, pH 7.5) and 1 % bovine serum albumin for at least 12 h 
at 4°C or 2 h at room temperature. The primary antibody was 



detected with '^'I-labcled anti-IgE antibody (Sanofi Pasteur Di- 
agnostics). 



RESULTS 

There are multiple IgE-binding regions throughout the Ara 
h 1 protein. The Ara h 1 amino acid sequence was analyzed for 
potential antigenic epitopes using computer-based algorithims. 
There were 11 possible antigenic regions, each containing 
multiple antigenic sites, predicted by this analysis along the en- 
tire Fength of the molecule (Fig. 1). 

77 overiapping peptides representing the entire length of the 
Ara h 1 protein were synthesized to determine if the predicted 
antigenic regions, or any other regions, were recognized by se- 
rum IgE. Each peptide was 15 amino acids long and offset froiii 
the previous peptide by eight amino acids. In this manner, the 
entire length of the Ara h 1 protein could be studied in large 
overiapping fragments. These peptides were then probed with a 
pool of serum IgE from 15 patients with documented peanut 
hypersensitivity or with serum TgE from a representative control 
patient with no food allergy. Serum IgE from the control patients 
did not recognize any of the synthesized peptides. In contrast, 
there are 12 IgE-binding regions (D1-D12) along the entire 
length of the Ara h 1 protein recognized by IgE from this pop- 
ulation of patients with peanut hypersensitivity (Fig. 2). These 
IgE-binding regions represent amino acid residues 35-72, 89 — 
112, 121-176, 289-326, 337-350, 361 -374, 393-416, 457- 
471, 489-513, 521-535, 544-583, and 593-607. In general, 
the predicted antigenic regions (Fig. 2, boxed areas Pl-Pll) 
contained or were part of those that were determined (Fig. 2, 
shaded areas D1-D12) by actual IgE binding. However, there 
were two predicted antigenic regions (amino acids A221-230; 
amino acids 263-278) that were not recognized by serum IgE 
from peanut hypersensitive individuals. In addition, there were 
numerous IgE-binding regions found in the Ara h 1 protein be- 
tween amino acids 450-600 (Fig. 2). 
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Fig. 2 Multiple IgE-binding regions identified on the Ara h 1 aller- 
gen (A) bpitope mapping was performed on the Ara h 1 allergen bv 
syn,hes.z.ng the entire prote.n m 15-am,no~acid-long overlapping pep^ 
■des that were offset front each other by e.ght a„t,„o' acids. Th se pej- 

the aT hi " f'ype^^ensu.vity. The position of the peptides within 
the Ara h 1 protem are mdicated on the left-hand side of A (B) The 
ammo acd sequence of the Ara h 1 protetn is shown. The numbered 
boxes correspond to the predicted antigenic regions (Pl-Pll) The 
ttl;^'Z:''''' '° '"^ I.B.b,„d,n;reg,ons Lwn^ 



To determine the amino acid sequetice of the fgE-binding 
sites small overlapping peptides spanning each of the larger 
lgE-b,nding regions identified in Fig. 2 were synthesized By 
synthesizing strialler peptides (10 amino acids long) that were 
offset from each other by only two amino acids, it was possible 
to Identify individual IgE-binding epitopes within the larger JgE- 
binding regions of the Ara h 1 molecule. A representative imniu- 
noblot and the respective amino acid sequence of the binding 
region D2-D3 (ammo acids 82-133) is shown in Fig 3 Fouf 
epitopes (Fig. 3. 4-7) were identified in (his region.' Similar 
blots were completed for the rematn.ng IgE-binding regions to 
Identify the core amino acid sequences for each FgE epitope 
Table 1 summarizes the 23 [gE-binding epitopes (peptides \- 
23) and their respective positions in the Ara h 1 molecule The 
niost common amino acids found were acidic (D, E) and basic 
(K, R) residues comprising 40% of all amino acids found in the 
epitopes. There were no obvious amino acid sequence motifs 
shared by all the IgE-binding epitopes. 

Identification of common Ara h 1 epitopes recognized by se- 
rum IgE from patients with peanut hypersensitivity. In an 
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EEGGRWGPAG 
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RWGPAGPRER 
GPAGPRERER 
AGPRERRREE 
PRER&REEDW 
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fit n ^"'"V''''^"'''"'""^ "P"»P«'' identified on the Ara h I allergen 

(A) Detailed epitope mapping was performed on IgE-bindin. reeL; 
identtfted ,n Rg. 2 by synthes.zmg tcn-amino-acid-long pep, de o 2 
from each other by two ammo acids. These peptides we e ?hen prot 
with a pool of sert„n IgE front 15 patients with documented 
hypersensmvity. The data shown represents regions D2 and a portmn o 
D3 encompassing amino acid residues 82-133. Numbers correspond to 
peptides as shown in Table 1. (B) The amino acid sequence ^ 
«2 133) of Ara h 1 that was tested in A is shown. Shaded areas of boxes 
correspond to common IgE-bindmg ammo acids of the spots shown ,„ A 

Table 1. Ara h I IgE-binding epitopes. The'underlined portions of each 

analvsisr I T'^T "''""'"^ ^ ^^'-"-d by th 

analysis described in Fig, 3. 



Peptide 


Amino acid sequence 


1 


AKSSPYQKKT 


2 


QEPDDLKOKA 


3 


le/dprlvyd 


4 


GERTRGRQPG 


5 


PGDYDDDRRQ 


6 


PRREEGGRWG 


7 


REREEDWR^P 


8 


EDWRRPSHQO 


9 


QPRKIRPEGR 


10 


TPGOFEDFFP 


n 


SYLQEFSRNT 


12 


FNAEFNEIRR 


13 


E^EERG^RRW 


14 


DITNPINLRE 


15 


INfNFGKLFEVK 


16 


GTGfJLELVAV 


17 


RRYTARLKEG 


18 


ELHLLGFGIN 


19 


HRIFLAGDKD 


20 


IDOIEKOAKD 


21 


KDLAFPGSGE 


22 


KESHFVSARP 


23 


PEKESPEKED 



Ara h 1 positions 



25- 34 
48- 57 
65- 74 
89- 98 
97-105 
107-116 
123-132 
134-143 
143-152 
294-303 
311-320 
325-334 
344-353 
393-402 
409-418 
461-470 
498-507 
525-534 
539-548 
551-560 

578-587 
597-606 
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Peptides 

5 6 7 8 



9 10 11 12 



■ '^j 14 15 16 17 18 19 20 21 22 23 

V 4 Ara h 1 epitopes recognized by serum IgE from a patient with 
' anut hypersensitivity. IgE-binding peptides (I -23) were synthesized 
'^ten-aniino-acid peptides and probed with serum IgE from a patient 
documented peanut hypersensitivity. 



effort to determine which, if any, of the 23 epitopes was iminu- 
nodominant, each set of 23 peptides was probed individually 
with serum IgE from ten different patients. An epitope can be 
considered immunodominant if it is recognized by serum IgE 
from the majority of patients with peanut hypersensitivity or if 
the serum IgE that recognizes a peptide represents the majority 
of Ara h 1 -specific IgE found in a patient. Serum from five 
individuals randomly selected from the 15 patient serum pool 
and an additional five sera from peanut-hypersensitive patients 
not represented in the serum pool were used to identify the com- 
monly recognized epitopes. An immunoblot strip containing 
these peptides that has been incubated with an individual pa- 
tient's serum is shown in Fig. 4. This patient's serum IgE recog- 
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Ara h 1 peptides 

Pig. 5. Identification of the immunodominant Ara h 1 epitopes. IgE-binding peptides (1-23) were synthesized as described in i ig. 4 and probed 
with serum IgE from 10 patients (A-J) with peanut hypersensitivity. Serum from five individuals randomly selected from the 15-patient serum 
pool and an additional five sera from peanut-hypersensitive patients not represented in the serum pool were used lo identify the immunodominant 
Epitopes. The relative intensity of IgE binding to each peptide was then determined by densitometry and expressed as a percentage of the patient's 
total IgE binding to all of the Ara h 1 peptides. 
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Fig. 6. Single amino acid changes to peptide 17 result in loss of IgE 
binding to this epitope. Peptide 17 was synthesized with an alanine 
residue substituted for one of the amino acids in this peptide and probed 
with a pool of serum IgE from 15 patients with documented peanut 
hypersensitivity. The letters across the top of each panel indicate the 
one-letter amino acid code for the residue normally at that position and 
the amino acid that was substituted for this residue. The numbers indi- 
cate the position of each residue in the Ara h 1 protein. WT, indicates 
the wild-type peptide (no amino acid substitutions). 



nized peptides 1, 4, 5, 8, 13, 15, and 17. Peptides 2, 3, 6, and 9 
were detected but could only be minor epitopes considering the 
amount of IgE that they bound. The remaining patients' serum 
IgE were tested in the same manner and the intensity of IgE 
binding to each spot was determined as a function of that pa- 
tient's total IgE binding to these 23 epitopes (Fig. 5). All of ijie 
patient sera tested (10/10) recognized inultiple peptides. The 
most commonly recognized peptides were those that contained 
epitopes 1, 3, 4, 13, 17, and 22. These epitopes were recognized 
by IgE from at least 80% of the patient sera tested (8/10). In 
addition, epitopes 1—4, 8, 12, and. 17, when recognized, bound 
more serum IgE from individual patients than any of the other 
epitopes. These results indicate that peptides 1, 3, 4, and 17 
contain the immunodominant epitopes of the Ara h 1 protein. 



IgE-binding characteristics of the immunodominant Ara h 1 
epitopes. The amino acids essential to IgE binding in epitopes 
1, 3, 4, and 17 were determined by synthesizing duplicate pep- 
tides with single amino acid changes at each position. These 
peptides were then probed with pooled serum IgE from 15 pa- 
tients with peanut hypersensitivity to detennine if the change 
affected peanut-specific IgE binding. An immunoblot strip con- 
taining the wild-type and mutated peptides of epitope 1 is shown 
in Fig. 6. The pooled serum IgE did not recognize this peptide, 
or binding was drastically reduced, when alanine was substituted 
for each of the amino acids at positions 28—30, or 32. In con- 
trast, the substitution of an alanine for glutamine residue at posi- 
tion 31 resulted in increased IgE binding. The remaining immu- 
nodominant Ara h 1 epitopes were tested in the same manner 
and the intensity of IgE binding to each spot was determined as 
a percentage of IgE binding to the wild-type peptide (Fig. 7). In 
general, each epitope could be mutated to a non-lgE-binding 
peptide by the substitution of an alanine for a single amino acid 
residue. There was no obvious position within each peptide that, 
when mutated, would result in loss of IgE binding. Furthennore, 
there was no consensus in the type of amino acid that, when 
changed to alanine, would lead to loss of IgE binding. 



DISCUSSION 

The development of an IgE response to an allergen involves 
a series of interactions between T cells and B cells. B cells bear- 
ing appropriate antigen-specific suiTace immunoglobulins in- 
teract with proliferating allergen-specific T-cells which leads to 
isotype switching' and the generation of antigen-specific IgE. 
The antigen-specific IgE then binds to the FceRI receptors of 
mast cells and basophils. This establishes a condition in which 
the IgE-bound mast cells can trigger a potentially life threaten- 
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Ara h 1 peptides 



Fig. 7. All of the immunodominant Ara h 1 peptides can be mutated to noii-IgE binding epitopes by single amino acid changes. 



All of the 



immunodominant Ara h 1 epitopes (peptides 1, 3, 4, and 17) were synthesized wiUi an alanine or glycine residue substituted for one of the am^^^ 
acids and probed with a pool of serum IgE from 15 patients with documented peanut hypersensitivity. The relative intensity of IgE binding ^^^^^^^ 
peptide was then determined by densitometry and expressed as a percentage of IgE binding to the non-mutated peptide (WT). The letters 
the bottom of each panel indicate the one-letter amino acid code for the residue normally at that position and the amino acid that was suos 
for this residue. The numbers indicate the position of each residue in the Ara h 1 protein. / ' 
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fc' hypersensitive reaction when next exposed to the 

Recause antigen-specific IgE plays such a critical role 
J^"^^"" 1 av of allergic disease, determination of allergen-spe- 
F Sin2 epitopes is an important first step toward a 
iific,: standing of this complex disease process. 
10^ "" present study, we have determined that there were 
, ' ^iaenic sites predicted for the Ara h 1 allergen. In 
iDuj^fPf ' has been found with other allergens [7, 8], there was 
eenient between those residues predicted by computer 
SgPP^ g^cell epitopes determined by experimental analy- 

^^^r^verlappin^ peptides. This correlation between predicted 
?WH^ermined epitopes is probably due to the ability of the 
#f ter model to predict which regions of the molecule are 
|S®"P" . the surface of the allergen, making them accessible 
P^^^unodobulin interactions. There are at least 23 different 
W"Lcognition sites on the ma,or peanut allergen Ara h 1. 
ftlnf se sites are distributed throughout the protein. The identifica- 
^Sil'^Lf multiple epitopes on a single allergen is not novel. Aller- 
1^ from cow milk [9], codfish [10], hazel [11], soy [121, and 
^^'^ K ' HP [13] have all been shown to contain multiple IgE-bmdmg 
^litopes The observation that most of these proteins have 
!|#lti^le'lgE-bindm sites probably reflects the polyclonal na- 
itjlire of the immune response to them and may be a necessary 
^ISep in establishing a protein as an allergen. 
-'- The elucidation of the major IgE-binding epitopes on Ara 
Pti 1 may also enable us to better understand the immunopatho- 
|-.*^enic mechanisms involved in peanut hypersensitivity. Recent 
fe'^^vidence suggests that there is a preferential variable heavy- 
t ^ chain usage in IgE synthesis and a direct switching from IgM 
%oduction to IgE synthesis [14]. This would suggest that epi- 
fe : topes responsible for antigen-specific IgE antibody production 
' may differ from those promoting antigen-specific IgG antibod- 
ies. Immunotherapeutic approaches utilizing peptides represent- 
ing IgG epitopes may be able to shift the balance of antigen- 
specific antibody production from IgE to IgG. We are currently 
identifying which of the IgE-binding epitopes also bind IgG to 
determine if this would be a feasible strategy for patients with 
peanut hypersensitivity. 

Four of the Ara h 1 peptides appear 4o be immunodominant 
IgE-binding epitopes in that they are recognized by more than 
80% of patient sera tested. Interestingly, epitope 17, which is 
located in the C-terminal end of the protein (amino acids 498- 
507), is in a region that shares significant sequence similarity 
with vicilins from other legumes [15]. The amino acids impor- 
tant for IgE binding also appear to be conserved in this region 
and may explain the possible cross-reacting antibodies to other 
legumes that can be found in sera of patients with a positive 
double-blind placebo-controlled food challenge to peanuts. Epi- 
topes 1, 3, and 4 located in the N-terminal portion (amino acids 
25-34,' 65-74, and 89-98) of the protein, appear to be unique 
to this peanut vicilin and do not share any significant sequence 
similarity with vicilins from other legumes [15]. In addition, the 
amino acids important to IgE binding in this region are not con- 
served. These findings may enable us to develop more sensitive 
and specific diagnostic tools and lead to the design of novel 
therapeutic agents to modify the allergic response to peanuts. 

Our data show that it may be possible to mutate the Ara h 1 
allergen to a protein that no longer binds IgE. This raises the 
possibility that an altered Ara h 1 gene could be used to replace 
its allergenic homologue in the peanut genome. In this manner, 
a hypoaliergenic peanut could be developed that may blunt aller- 
gic reactions in sensitive individuals who inadvertently ingest 
this food. Furthermore, a hypoallergeic peanut may prevent the 
development of peanut hypersensitivity in individuals geneti- 



cally predisposed to this food allergy. Because the Ara h 1 gene 
product is such an abundant and integral seed storage protein 
[15], a prudent approach would require that the altered vicilin 
retain as much of its native function and properties as possbile. 
Given the potential severity of peanut allergic reactions and the 
widespread use of peanuts m consumer foods, this possibility is 
actively being pursued in our laboratories. 

This work was supported in part by grants from the National Insti- 
tutes of Health (AT33596) and the Clarissa Sosin Research Foundation. 
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Evidence for an early appearance of modern post- 
switch isotypes in mammalian evolution; cloning of 
IgE, IgG and IgA from the marsupial Monodelphis 
domestica 

Maria Aveskogh and Lars Hellman 

Department of Medical Biochemistry and Microbiology, University of Uppsala, Uppsala, Sweden 

In birds, reptiles and amphibians the IgY isotype exhibits the functional characteristics of 
both of IgG and IgE. Hence, the gene for tgY most likely duplicated some time during early 
mammalian evolution and formed the ancestor of present day IgG and IgE. To address the 
question of when IgY duplicated and formed two functionally distinct Isotypes, and to study 
when IgG and IgA lost their second constant domains, we have examined the Ig expression 
in a non-placental mammal, the marsupial Monodelphis domestica (grey short-tailed opos- 
sum). Screening of an opossum spleen cDNA library revealed the presence of all three Iso- 
types in marsupials. cDNA clones encoding the entire constant regions of opossum IgE (e 
chain), IgG (y chain) and IgA (a chain) were isolated, and their nucleotide sequences were 
determined. A comparative analysis of the amino acid sequences for IgY. IgA. IgE and IgG 
from various animal species showed that opossum IgE, IgG and IgA on the phylogenetic tree 
form branches clearly separated from their eutherian counterparts. However, they still con- 
form to the general structure found in eutherian IgE. IgG and IgA. Our findings indicate that 
all the major evolutionary changes in the Ig isotype repertoire, and in basic Ig structure that 
have occun-ed since the evolutionary separation of mammals from the early reptile lineages, 
occuned prior to the evolutionary separation of marsupials and placental mammals. 

Key words: Marsupial / IgG / IgE / IgA / IgY / Evolution 
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1 Introduction 

Neither IgG nor IgE have been identified in reptiles, 
amphibians or birds and are thus probably unique for 
mammals [1). In birds and reptiles antibodies of the IgY 
isotype exhibit the functional characteristics of both IgG 
and IgE. Hence, it is pnDbable that IgY duplicated some- 
time during the early mammalian evolution and formed 
two separate Ig classes [1]^ IgY of birds and reptiles has, 
like IgE and IgM, four constant domains, while IgG in pla- 
cental mammals has only three. A sequence comparison 
between different domains of IgY and IgG shows that it is 
most likely the second domain of IgY which has been 
deleted [1], A short hinge exon is probably the only 
remaining part of the original exon encoding the second 
constant domain [1]. Interestingly, IgA in birds also has 
four constant domains, while the a chain of IgA in pla- 
cental mammals has only three. Hence, two separate 
deletion events seem to have occurred during early 

[1 18062] 



mammalian evolution. The amphibian Xenopus laevis 
expresses an isotype with similar tissue distribution as 
mammalian IgA [2]. However, this isotype (IgX) does not 
seem to be a direct homolog of IgA in mammals and 
birds. IgX shows, for example, a higher sequence iden- 
tity with IgM than with mammalian and bird IgA [2]. Inter- 
estingly, IgY in the amphibian axoloti [Ambystoma mexi- 
canum) has the capacity to be transfen^ed across epi- 
thelial surfaces, in that respect resembling mammalian 
IgA [3]. Consequently, IgY of certain amphibians, which 
lack IgX, seems to display the effector functions of all 
three mammalian Ig isotypes (IgG, IgA and IgE). 

Marsupials are thought to have separated evolutionarily 
from the eutherian mammals 140 to 200 million years 
ago, while the major radiation of the placental mammals 
probably occurred 140 to 100 million years ago. To fur- 
ther understand the evolution of Ig classes, studies of 
mammals distantly related to placentals, such as marsu- 
pials, may provide useful information. We here present 
the cloning of the z chain of IgE, the y chain of IgG and 
the a chain of IgA from the marsupial Monodelphis 
domestica, the grey short-tailed opossum. 
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Tyr Gly Tyr Arg Leu Asp Asn Trp Gly Lys Gly Thr Thr Vol Thr Vol Ser Ser Glu Pro Leu Thr Ala Pro Ser Vol Phe Pro Leu Mel 
TACGGATATAGGCTTGATAAC TGGGGCAAGGGGACCACGGTGACTGTGTCCTCA GAACCCCTCACAGCTCCATCTGTCTTCCCTTTGATG 90 

Sar Cys Cys Asp Ser Asn Ser Gly Gly Lys Ala Phe Leu Gly Cys Leu AIo Thr Gly Tyr Phe Pro Glu Ser Vol Thr Ilo Asn Trp Asn 
TCCTGCTGCGATTCCAATTCTGGAGGCAAAGCTTTCCTGGGATGCCTTGCAACTGGCTATTTCCCTGAGTCAGTGACTATAAACTGGAAC 

Ser Ghj Vol teu Asp Glu He He Thr Asn Phe Pro Ala Thr Tyr Asp Pro Thr Ser Gly Arg Tyr Thr Thr Thr Ser Gin Leu Thr Vat 
AGTGAAGTCCTGGAT6AGATAATCACAAACTTTCCAGCTACTTATGATCCCACATCTGGACGCTATACCACCACCAGCCAACTTACCGTC 270 

* I* 

Ser Asp Leu Ser Asp Gin Glu Phe Thr Cys Ser Vol Asp His Leu Pro Thr Ser Thr Lys He Lys Lys Thr Leu Ser Leu Pro Glu Cys 
TCAGATTTGAGTGACCAGGAATTCACCTGCAGCGTGGACCACTTACCTACCAGCACCAAAATCAAAAAAACTTTATCTCTCCCAGAATGT 360 

Gfy Pro Vol Thr He He Pro Pro Thr Vol Lys Leu Phe His Ser Ser Cys Asp Pro Arg Gly Asp Ala His Ser Thr Me Gin Leu Uu 
GGCCCTGTGACAATCATCCCACCTACAGTGAAGCTCTTCCACTCATCCTGTGACCCCCGAGGGGATGCTCATTCCACCATCCAGCTGCTC 150 

Cys Uu Vol Ser Gly Phe Ser Pro Ala Lys Vol His Vol Thr Trp Leu Vat Asp Gly Gin Glu Ala Glu Asn Leu Phe Pro Tyr Thr Thr 
TGCCTTGTCTCTGGCTTCTCCCCAGCCAAGGTCCATGTGACCTGGCTGGTAGATGGACAGGAGGCTGAAAATCTCTTTCCCTATACAACC 

o 

Arg Pro Lys Arg Glu Gly Gly Gtn Thr Phe Ser Leu Gin Ser Glu Vol Asn He Thr Gin Gly Gin Trp Met Ser Ser Asn Thr Tyr Thr 
AGACCTAAGAGGGAAGGGGGACAGACTTTTTCTCTACAAAGTGAAGTCAACATCACACAGGGCCAGTGGATGTCATCAAACACCTACACC 

* O >Bi I 

Cys His Vol Lys His Asn Gly Ser He Phe Glu Asp Ser AIo Gin Lys Cys Ser Asp Thr Asp Pro Arg dy He Ser AIo Tyr He Leu 
TGCCATGTCAAGCACAATGGCAGCATCTTTGAAGACAGTGCCCAAAAGTGTTCAGACACTGATCCACGAGGCATCAGTGCCTACATCCTC 

♦ 

Pro Pro Thr Pro Gin Asp Leu Phe Vol Lys Lys Vol Pro Thr II© Gly Cys Uu He tal Asp Uu Ala Ser Ala Glu Asn Vol Lys Vol 
CCACCTACTCCTCAAGACCTCTTTGTCAA6AAAGTTCCCACAATTGGATGCCTGATAGTGGACCTGGCCAGTGCAGAGAATGTGAAGGTT 

o o 

Thr Trp Ser Arg Glu Ser Gly Gly Pro Vol Asn Pro Ser Ser Leu Vat Vol Lys Glu Gin Tyr Asn Gty Thr Phe Thr Vol Thr Ser His 
ACTTGGTCCCGAGAAAGTGGAGGTCCTGTCAATCCAAGCTCACTAGTGGTGAAGGAGCAGTATAATGGCACTTTCACAGTCACCTCACAC 

Leu Pro Vol Asn Thr Asp Asp Trp He Glu Gly Asp Thr fyr Thr Cys Arg Uu Glu Ser Pro Asp Het Pro Vol Pro Uu He Arg Thr 
CTGCCTGTCAACACAGATGATTGGATAGAGGGTGATACCTATACCTGCCGTCTGGAAAGCCCTGACATGCCCGTCCCTCTCATCAGGACC 

He Ser Lys Ala Pro'ciy Lys Arg Uu AIo Pro Glu Vol Tyr Met Uu Pro Pro Ser Pro Glu Glu Thr Gly Thr Thr Arg Thr Vol Thr 
ATCTCCAAGGCTCCAGGCAAACGCTTAGCCCCCGAGGTATATATGCTCCCTCCATCTCCAGAGGAAACAGGAACCACTCGCACTGTAACC 

Cys Uu He Arg Gly Phe Tyr Pro Ser Glu He Ser Vol Gin Trp Uu Phe Asn Asn Glu Glu Asp His Thr Gly His His Thr Thr Thr 
TGCCTAATTCGGGGTTTCTACCCTTCTGAAATATCTGTCCAATGGCTGTTTAATAACGAAGAGGACCACACTGGACACCATACTACCACC 

Arg Pro Gin Lys Asp His Gly Thr Asp Pro Ser Phe Phe Leu Tyr Ser Arg Met Leu Vol Asn Lys Ser He Trp Glu Lys Gty Asn Uu 
CGTCCCCAAAAGGACCACGGAACGGATCCTTCCTTCTTCCTCTACAGCCGAATGCTTGTCAACAAGTCTATTTG6GAAAAAGGCAATCTC 

* 

Vol Thr Cys Arg Vol Vol His G»u Ala Uu Pro Gly Ser Arg Thr Uu Glu Lys Ser Leu His Tyr Ser AIo Gly Asn • 
CTCACCTGCCGTGTGGTGCATGAAGCCCTACCTGGCTCCCGCACCCTGGAAAAAAGCCTGCATTACTCAGCTGGTAACTGATTACCCTGA 

CCCAGACCTTTTACCCCTTCTAAACTCCT.CATGAGGAGCCATTCTGGTCCCCTAGAGGAGGGTACATCTGTCTGGTGTCTAAATACCCCA 1440 
GCAGTTGACTTAATAAATGTATATGTCCATGTCAAAAAAAAA 1M82 
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F/gare ). Staicture of the opossum e chain cDNA clone. A schematic map (in scale) of the opossum e chain cDNA clone is 
shown at the top of the figure. The striped boxes indicate the four constant regions (CH1-CH4), the joining segment is repre- 
sented in black and the variable region, together with the D segment, in dark grey. The approximate positions of the intra-chain 
disulfide bridges have been marked by brackets. An H marks the cysteines involved in putative inter-chain disulfide bridges 
between the two heavy chain polypeptides, and the L marks the cysteine involved in the putative inter-chain disulfide bridge 
between the heavy and the light chain polypeptides. The entire sequence of the insert from the original lambda clone, excluding 
the linker sequences which have been deleted from the 5' and 3' ends, respectively, Is depicted below the schematic map. The 
tentative positions of the exon boundaries, based on the exon-intron structures of mouse, rat, human, and dog e chain genes, are 
indicated by small vertical lines. The position of the J segment is highlighted by a thick black line below the con-espondlng 
sequence. The cysteines involved in intra-chain disulfide bridges are marked by asterisks, and the different disulfide bridges are 
marked by thick brackets to the right of the sequence. The amino acid sequence is depicted above the con-esponding coding 
sequence, and potential N-glycosylation sites are indicated by open circles. The nt sequence of opossum IgE has been submit- 
ted to the EMBLVGenBank nucleotide sequence databases and has been assigned the accession number AF 035194. 



" EarrrtPftmunorri 998: 28; '27^^2750^ 




OfOSilJH 

PIC 

SHEEP 

HOUSE 

RAT 

r<cc 
nonsi 

CHIHPANZtZ 



"^^Er. J. Immunol. 1998.78; 2738-2750 



M Figure 2. Amino acid alignment of all presently available e chain sequences. Amino acid positions that are conserved in at least 
three of the aligned sequences are represented in black. The cysteine residues involved in inter-chain and intra-chain dtsulf ide 
bridges are marked by asterisks. The cysteine residues involved in the formation of inter-chain disulfide bridges to the iignt or 
heavy chains are indicated by L or H, respectively. The positions of the exon-intron boundaries fin the cases when the gene struc- 
ture has been determined) are indicated by a pointed black dot above the con-esponding sequence. 



2 Results 

2.1 Cloning and nucleotide sequence analysis of 
Af. domestica IgE, IgG and IgA c, y and a 
chains 

A PCR-based cloning strategy was adopted for the 
isolation of cDNA clones encoding the e and y chains of 
opossum IgE and IgG. Initially, a comparison of the 
nucleotide sequences for the coding regions of the t and 
Y chains of a panel of different eutherian IgG and IgE was 
made. Based on the comparison, two regions with a high 
degree of sequence conservation for each of these two 
isotypes were selected for synthesis of degenerate oli- 
gonucleotide primers. PGR amplifications, using the 
degenerate primers, were performed using purified 
lambda DNA from an amplified cDNA library derived 
from spleen of the grey short-tailed opossum as tem- 
plate. This resulted in isolation of partial cDNA clones for 
opossum IgE and IgG. These partial cDNA clones were 
used as probe to screen the same cDNA library that had 
been used during the initial PGR amplification. Screening 
of about 150 000 plaques from this cDNA library with the 
£ chain probe resulted in approximately 40 positive sig- 
nals, and screening of approximately 50000 plaques 
with they chain probe resulted in more than 100 positive 
signals. Four of the e chain clones, and eight of the y 
chain clones were selected for further analysis. Nucleo- 
tide sequence analysis from both ends of the inserts of 
each clone showed that one of the clones contained the 
entire coding region for the constant region of the opos- 
sum IgE e chain (Fig. 1) and one clone the entire coding 
region of a IgG y chain, including the signal sequence, a 
variable region, the DJ segments and the entire constant 
region (Fig. 3). 

The opossum cDNA library was then screened with the 
variable region from the opossum IgG (y chain) as probe. 
Following screening of the V region-positive clones with 
probes for both IgG (y chain) and IgE (e chain), 12 IgG- 
and IgE-negative clones were identified. Using a primer 
directed against the region spanning the J segment, the 
nucleotide sequences of the N-terminal region of the 
constant domains of these 12 clones were determined. 
From this screening several clones for opossum IgA 
were identified. Nucleotide sequence analysis from both 
ends of the inserts of each clone showed that one of the 
clones contained the entire coding region of a IgA a 
chain, including the signal sequence, a variable region, 



the DJ segments and the entire constant region (Fig. 5). 
The complete nucleotide sequence was determined for 
the IgE, IgG and IgA e, y and a chain clones (Figs. 1 , 3 
and 5). The corresponding amino acid sequence of the 
constant region for opossum IgE, IgG and IgA e, y and a 
chains was compared with the previously published 
amino acid sequences of mammalian IgE, IgG and IgA 
sequences (Figs. 2, 4 and 6). The result shows that the 
overall stmcture of the opossum IgE conforms to the 
structure of the previously characterized eutherian IgE 
(Fig. 2). The various cysteine bridges have been con- 
served, including the second disulfide bridge in the CHI 
domain which has been lost in rodent IgE (Figs. 1 and 2). 
The structural analysis of opossum IgG and IgA shows 
that the overall structure of these isotypes also conforms 
to the structure of the previously characterized eutherian 
IgG, including the presence of three constant domains 
(Figs. 3-6). 

Nucleotide sequence analysis of five independent £ 
chain clones and eight independent y chain clones, giv- 
ing identical sequences within the regions analyzed, indi- 
cated that only one IgE and one IgG Isotype is expressed 
in the opossum (data not shown). Nucleotide sequence 
analysis of two independent a chain clones showed 
almost identical sequences. A few nucleotide differences 
between the clones originate most likely from allotypic 
variations and not from separate isotypes. However, 
based on only two IgA sequences, we can at present not 
draw any conclusions on whether one or several IgA iso- 
types are expressed in the opossum (data not shown). 

A comparison was also made between opossum IgG, 
IgE and IgA and all presently available amino acid 
sequences for the constant regions of mammalian IgE, 
IgG and IgA as well as a few IgA and IgY sequences from 
birds and an amphibian {A, mexicanum). The resulting 
phylogenetic tree shows that the opossum IgE, IgG and 
IgA form separate branches on the phylogenetic tree, 
outside of all eutherian IgE. IgG and IgA so far isolated 
(Fig- 7). 

2.2 Heavy chain V region diversity in the 
opossum 

Recent data indicate that only two V gene families (VH1 
and VH2) exist in the opossum and that both of these 
gene families belong to the group 111 of mammalian V 
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Met Asp Phe Arg Leu Asn Trp five Phe Phe Leu tie Thr Uo G(n Dy Vol AspSer GIu Vat 
GGCCAGGACTGAACCAGAGTCCTCATCATGGACTTTAGGCTAAACTGGTTTTTCTTTCTAATAACTTTACAAGGTGTTGACAGTGAGGTI 90 

* mm 
Gin Leu Vol GIu Ihr Gly Gty Asp Vol Arg Gin Pro G»y Gly Ser Leu Arg Leu Ttr Cys Ihr Ser Ser G»y Pbe Thr Leu Ser Thr Tyr 
CAGCTGGTGGAGACTGGGGGAGATGTGAGGCAGCCTGGGGGCTCTCTTCGACTCACTTGTACAAGTTCTGGATTTACCTTATCCACCTAC IBO 

Tyr Met Ws Trp lift Arg Gin Alo Pro Gly Lys Gly Leu Gtu Trp Vat Ala Vol lie Arg Asn Pro Ala Asn Gly Leu ^ Akj Gtu Tyr 
TACATGCATTGGATTCGACAGGCTCCA6GCAAGGGGCTGGAGTGGGTCGCTGTAATAAGAAATCCTGCTAATGGTCTTACTCCAGAATAT 270 

Gly GIu Mo Vol Lys Gly Arg Phe TYr He Ser Arg Asp Asp Alo Ser Lys Het Vat Tyr Leu Gin Het Asn Asn Uu Lys T)tr do A^ 
GGAGAGGCTGTGAAAGGCCGATTCACCATTTCCAGAGATGATGCCAGTAAGATGCTATATTTGCAAATGAACAACTTGAAAACTGAGGAC 360 

* I ^ 

Thr Alo Thr Tyr Pho Cys Ser Lys Asp Leu GIu PVie Trp GJy Lys Gly TV T>ir Vol Thr Vol Ser Ser Ala Arg Pro Thr Alo Pro Ser 
ACAGCAACATATTTTTCTTCAAAAGATCTTGAGTTTTgGGGJA|JGGgGACJACGG^ ^50 UO 

Vol Phe Pro Leu Vol Ser Ser Cys Gly Gin GIu Thr Gin Aki Gin Het Ala Leu Gly Cys leu Vol Thr Ser Tyr Pho Pro GIu Pro Val 
GTCTTTCCCCTGGTATCCAGTTGTGGACAGGAAACACAGGCCCAGATGGCTCTGGGCTGCCTGCTGACAAGCTACTTCCCTGAGCCAGTG 5«0 



CHI 



Thr Vol Thr Trp Asn Ser Gly Thr Ttr T>v Ser Gly He G)n Tbr Tyr Pro Ser Vol Leu Gin Pro Ser Gly Leu Tyr Thr Leu Thr Ser 
ACAGTGACATGGAATTCAGGGACCACCACCAGTGGAATCCAGACtTATCCCTCTCTACTCCAGCCCTCAGGACTCTACACTTTAACCAGT 630 

* 

Gin Lfiu Thr He Fro Ala Asp Ser Trp Ser Ser Gin Ser Tyr Thr Cys Asn Vol Ala HJs Piro A»o Thr Ser Thr Lys lie Asp Lys Lys 
CAGTTGACAATTCCTGCAGATTCTTGGTCCTCTCAGTCATATACCTGCAATGTGGCTCACCCAGCCACATCCACCAAGATCGACAAGAAA 720 

I Bi mm m \ Hinge 

lie Glu Ala Thr Thr Thr Thr Cys Pro Cys Cys Lys Cys Asn Thr Vol Asp AJo Gly Gly Pro Ser Vol Phe Vol Phe Pro Pro Asn Pro t> 
ATTCAAGCAACTACTACAACATGTCCATGCTGCAAATGCAACACAGTTGACGCCGGTGCACCTTCTGTTTTT6TCTTCCCTCCAAATCCT 810 

Gin Asp Val Leu Lys Leo Ser Arg Ser Pro Lys Vol Thr Cys Met Vol Vol Asp Vol Ser Ala Sor Gly Vol Gin fie Thr Trp Phe 
CAGGATGTCCTCAAACTCTCAAGATCCCCTAAAGTTACCTGTATGGTGGTTGATGTAAGTGATGCATCAGGTGTTCAGATTACCTGGTTC 900 

o I CH2 

Lys Gly Gtu GIu Glu Vol Ser Ser Pro Lys Leu Thr Gin Lys Lys Uu Asn Asn Gly Thr Phe Gift Vol Vol Ser Aan Leu Pro Vol Vbl 
AAAGGT6AAGAGGAAGTCAGCAGTCCTAAACTCACCCAGAAGAAATTAAACAATGGCACC1TTCAGGTGGTCAGCAATCTCCCTGTAGTC 990 

o 

His Gin Glu Trp Leu Lys Gly Thr Sor Tyr Thr Cys lys Vol Asn Thr Ser Gtu Leu Pro Vol Vol Glu Arg Lys Thr lie Ser His Thr 
CACCAGGAATGGCTGAAAGGCACTTCATACACCTGTAAAGTTAATACCACTGAACTACCAGTTGTTGAGAGAAAGACCATATCCCACACT 1080 

I * 

Lys Gly GIu Arg Lys Lys Pro Asp lie Tyr Vol Phe Gly Pro His Pro Asp Glu Leu Lys Gin Lys Asp Asp Vol Ser He Thr CyS Lou 

AAAG6TGAGAGAAAGAAGCCTGATATATATGTCTTTGGCCCACATCCTGA7GAGTTGAAACAAAAAGATGATGTCAGTATTACCTGCCTA M 

Vol Thr Asn Phe Phe Pro Glu Asp Vol Vol He Glu Trp Gin Lys Asn Asn Asn Pro Glu Ser Glu Asp Lys Tyr Tyr Thr Thr Pro Thr 
GTGACCAATTTCTTCCCTGAAGATGTTGTTATCGAATGGCAAAAGAACAACAATCCAGAGTCTGAAGATAAATATTaCACCACCCCAACA 1260 

* 

Thr Arg Glu Lys Ser Thr Tyr Phe Phe Tyr Ser Lys Leu lie Val Lys Lys Arg Asp Trp Asp Asn Gin Asn Ser Tyr Thr Cys Val Vol 
ACGAGGGAAAAGAGCACCTACTTTTTCTACAGCAAGCTTATTGTGAAGAAAAGAGATTGGGATAACCAAAACTCCTATACCTGCGTAGTG 1350 

Leu His Glu Alo Phe Pro Asn Gin He Ser Gfn Arg Thr He Ser Ala Ser Pro Gly Lys • 

TTt^rATCAf^KfrTTTrCAAACCAAATTTCCCAGAGGACAATCTCTGCATCCCCGGGTAAATGAGAAAGCCAAGAGAAT CACACACAT^ 1440 
C AC AC AC AC AC AC AC AC AC AC AC AC AC AC AC AC AC AC AC ACC AC AC AC AJ^^^JT C AT CAK ATC AT m ^ C T 1530 

CCTCCAGGGATAGCCAGTCTGGAGGAGTTCCCTGTCTACTTCTTACCCAATTCTCCTTCATAGGGTCTTCCCTCCTTCATATTTTCCAAG 1620 
GCTTGGCGATAGAAGCCTCACATGTAAATACTTTCCATTATCCTCAAGCAAAATAA^AACACCCAGCACCATAAAAA 1699 

Figure 3, Structure of the opossum y ctiain cONA clone. For details concerning the abbreviations see the legend to Fig. 1. The 
tentative positions of the axon boundaries, based on the exon-lntron structures of mouse y chain genes, are indicated by small 
vertical lines. Two regions with repeated sequences were identified in the 3' non-coding region of the mRNA. These two repeats, 
a CA repeat and a CAT repeat, have been marked by an open and a striped box, respectively. The nucleotide sequence of opos- 
sum IgG has been submitted to the EMBtVGenBank nucleotide sequence databases and has been assigned the accession num- 
berAF 035195. 
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gene families [4]. It is not yet known if this apparent limi- 
tation in V gene germ-line diversity is compensated for 
by a particularily large variation in complementarity-de- 
termining regions (CDR) of these germ-line genes, or by 
somatic mutations or by N region diversity in the CDR3. 



The variable regions of the opossum y chain and a chain 
clones were compared with a panel of these recently 
published heavy chain V region sequences [4]. The 7 
chain V region was found to be a new member of the 
opossum VH2 gene family [4]. Twelve nucleotide differ- 
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Avgure 4. Amino acid alignment of a selected panel of y chain sequences. For details concerning the abbreviations see the 
legend to Fig. 2. 



ences were found within a 353-nucleotide region, when 
comparing our sequence with a germ-line VH2 sequence 
[4], The variable region of the opossum a chain clone 
was found to be a new member of the opossum VH1 
gene family [4). Based on these two complete V regions 
and preliminary data from several additional V region 
clones (work in progress) we see indications of a rela- 
tively large degree of variability in the CDR3 region which 
may indicate that the variation in the CDR3 may well 
compensate for the low variability in the germ-line V regi- 
ons. 



3 Discussion 

Previous investigations of mammalian Ig evolution have 
mainly focused on placental mammals. In the present 
report, we demonstrate for the first time the presence of 
IgG, IgE and IgA in a mammal distantly related to the pla- 
cental mammals, the marsupial M. domestica, A com- 
parison of the presently available mammalian c, y and a 
chain sequences indicates that only minimal changes in 
the overall structure have occurred in the last 1 40-200 
million years since the divergence of marsupials and 
placental mammals [5]. Except for several small 
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rtet GV Phe Gly Uu Asn Gly He Phe He Leu Met Ha Uu Gin 
GGTGGCCTTTCCCAGCTGAGGACTGACACTGTAGAGCCTGAATCATGGGATTTGGACTCAArGGCATTTTCATTCTGATGATTTTGCAAG 90 

t 

Gly Vol HIS Cys Asp lie Gin Leu Vol Glu Sen Gly Gly Asp Vbl Ang Gin Pro Gly Gly Ser Leu Lys Leu Ser Cys Lys Ala $er Gfy 
GTGTCCATTGTGACATrCAGCTGGTGGAGTCTGGGGGAGATGTGAGACAGCCTGGAGGATCCTTGAAACTCTCCTGCAAAGCTTCTGGAT 180 

Phe Thr Pha Sot Ser Tyr Asp Met Ser Trp Vol Arg Gin Ala Pro Gty Lys Gly Leu Glu Trp Vbl Ser Tyr lie Ser Ser Gly Gly Ha 
TTACCTTCAGCAGCTATGACATCAGITGGGTCCGCCAGGCTCCAGGAAAGGGGCTGGAGTGGCTCTCATATATTAGTAGTGGTGGAATTA 270 

Ser Thr Asn V Scr Vol Lys Gly Arg Phe Thr He Ser Arg Asp Asn GJy Asn Ser Leu Leu His Leu Gin Net Thr Ser Leu 

GTACAAACTATGCAGACTCTGTGAAAGGCCGATTCACCATCTCCAGGGACAATG6CAACAGCCTACTGCACCTGCAGATGACTTCCCTGA 360 
* 

Lys Alo Glu Asp Thr Ala Thr Tyr Tyr Cys Ala Arg Asp Arg Pro Gly Thr Vol Alo Vol Asp Asn Trp Gty Lys Gly Thr Thr Vol Thr ¥5,1 
AAGCCGAGGACACAGCCACGTATTACTGTGCAAGAGACCGCCCAGGTACAGTGGCCGTTGATAACTGGGGCAAGGGG^ ^50 

Vbl Ser Ser Pro Lys Thr Ser P^o Thr \bl Phe Pro Hat Ser Uu Cys Asp Ser Ang Thr Gly Asp Ser Vol Ala Leu GV Cys ^ 
rATCCTCAGTCCCCAAGACCTCACCCACTGTTTTCCCTATGAGCCTCTGTGACTCTAGAACTGGTGACTCAGTGGCCCTAGGCTGCCTTG 5^0 

Ala Gin Gly Phe Phe Pro Glu Pro tal Thr Leu Ser Trp Asn Tyr Asn Gly Ser Gki Gty Thr Vol Arg Ser Tyr Pro Ala Mel Leu Ser 
CTCAAGGCTTCTTCCCAGAGCCTGTAACGCTGTCCTGGAATTACAATGGCTCTGAAGGCACAGTGAGGAGCTATCCTGCTA7GCTATCTG 630 

* ★ o 

Gly Asn Thr Tyr Leu Gin Thr Ser V6t Leu Asp Leu Pro Ala Asn Gtn Cys Pro Glu Ala Tyr Lys Cys Arg Ala Glu His Tyr Asn Thr 
GAAATACATACCTCCAAACCAGTGTGCTTGATCTGCCTGCCAACCAGTGCCCTGAGGCTTACAAGTGCCGTGCAGAACACTATAACACCA 720 

Ser Vol Asp Ak) Asn Val Pro Cys Pro Vol^Pro Pro Arg Pro Cys Asp Cys Pro Ser Gly Val His Vat Ser Leu Ser G»y Pro Ser Leu 
GTGTTGACGCGAATGTTCCCTGCCCAGTACCTCCTCGCCCATGTGATTGCCCAAGTGGAGTTCATGTATCCCTGAGTGGCCCCTCCCTGG 810 

o * 

Glu Ser Leu Leu Leu Gty lie Gly A(o Asn Uu Thr Cys Thr Leu Ser Gly He Lys Asn Ser Asn Gly Ala Thr Phe Thr Trp Vol Hts 
AGAGCCTGCTCCTGGGTATAGGCGCCAACCTGACATGCACACTCAGTGGCATCAAGAACAGTAATGGAGCCACATTCACATGGGTGCATG 900 

Asp Thr Ata Gin Ata Ser Thr Uu Arg Pro lie Gin Gly Ala Pro Glu Gin Asp Ser Asn G»y Lys Tyr Arg Vat Ser Ser Val Leu Glu 
ATACAGCCCAGGCATCAACTTTGAGACCTATCCAAGCAGCACCTGACCAAGACTCTAATGGCAAGTACAGGGTGTCCAGTGTCCTGGAGA 990 

He Cys Thr Glu Glu Trp Leu Arg Gly Asp Thr Phe Ser CyS Ihr Vol Ser His Ser GUi Ha Gb Thr Thr Thr LyS Thr He Tyr Lys 
ICTGCACTGAGGAGTGGCTCCGTGGTGACACCTTrTCCTGCACTGTCTCCCATTCrGAGATAGAAACTACTACTAAGACCATCTACAAGC 1080 

I * 

Pro Lys Vol Pro Gin He Pro Pro Cin He Tyr Leu Leu Alo Pro Ser Alo Asp Glu Gin Ala leu Asn Glu Met Vol Ser He Thr Cys 
CAAAAGTACCCCAGATACCACCCCAGATCTACCTGCTGGCACCATCAGCAGATGAACAAGCCTTAAATGAAATGGTCTCAATCACATGCC 1 170 

Leu Vol Arg Gly Phe Ser Pro Glu Asp He Phe He Arg Trp Leu Lys Gly Ser Glo Glu Leu Pro Lys Lys Asp Tyr lie Thr Ser Asn I ^xTO 

TGGTGAGAGGCTTTAGCCCAGAAGACATTTTCATACGATGGCTAAAAGGGTCTGAGGAGCTGCCCAAGAAGGACTATATCACCAGCAACC 1260 | l^JlO 

Pro Tyr Pro GKj Pro Lys Ser Thr Ser Thr Tyr Met Vol Ser Ser He Leu Gin Vol Gin Ser Thr Asp Trp Lys Asn Glu Asn Lys Tyr 
CTTACCCAGACCCCAAGAGCACCAGCACCTACATGGTATCTAGCATCCTCCAAGTCCAAAGCACTGATTGGAAAAATGAAAACAAATACT t350 

* o o 

Ser Cys Val \tal Gly His G(u Ala Uu Pro Leu Asn Phe Thr Gin Gin Thr He Asp Arg Uu Ser Gty Lys Pro Thr Asn Vol Asn tfol 
CCTGTGTGGTAGGCCATGAAGCTCTTCCCCTGAACTTCACCCAGCAAACCATCGACCGATTGTCTGGTAAACCCACGAATGTCAATGTAT )i|*tO 

# 

Ser Vol lie Mat Ser Asp He Tyr Gly Thr Cys Tyr • 

CTGTCATCATCrCTCATATCTATGGTACCTGTTATTAAACACAGACTAAACCACCAGGCCCCCATCACTGCCCATAACTTGAATTACAAG 1530 
GCATGGATGAAGGGGACATAGTTCTATAGAAAAGTACCTAATGGAA TAAAT CTTGACACTCAGAAAA 1597 
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Figure 5. Structure of the opossum a chain cDNA clone. For details concerning the abbreviations see the legend to Fig. 1, 



insertions and deletions (one to four full codons in 
length), only very few changes other than single point 
mutations have occurred in IgG, IgE and IgA. However, 
the number of point mutations have been relatively high, 
resulting in substantial divergence among the various 
primary nucleotide and amino acid sequences. Both the 
Y chain of opossum IgG and the a chain of opossum IgA 
were found to contain only three constant domains 



(Figs. 3 and 5). This could either be explained by a dele- 
tion prior to the evolutionary separation of marsupials 
and placental mammals or by two independent dele- 
tions, one in each of these two major mammalian line- 
ages. The very high overall structural similarity of marsu- 
pial and placental IgG and IgA does, however, strongly 
favor the first alternative. Thus, the most likely explana- 
tion is that the second domain of IgG and IgA was 
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Figure 6. Amino acid alignment of all presently available a chain sequences. For details concerning the abbreviations see the 
legend to Fig. 2. Two cysteines which are found in alt a chain sequences from placental mammals but not in the a chain opossum 
sequence are marked with asterisks within brackets. The cysteine residues involved in the formation of inter-chain disulfide brid- 
ges to the light or heavy chains are Indicated by L or H, respectively. Two of these are also marked with a question mark indicat- 
ing the uncertainty of these cysteine residues. Although less likely, they may instead be involved in an additional intra-chain disul- 
fide bridge (with each other). 
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Figura 7. A comparative analysis of the amino acid sequence of the four constant domains from the presently available mamma- 
lian IgE, IgG and IgA sequences, two IgA and two IgY sequences from birds and the IgY sequence from the axoloti, presented 
in a phylogenetic tree. The bootstrap tree is based on 1000 independent comparisons and the bootstrap values are added at 
each branch point. The sequence distance is represented by the length of the branches according to the scale at the bottom of 
the figure, which corresponds to a sequence divergence of 4 %. 



deleted prior to the evolutionary separation of marsupi- 
als and placental mammals. The opossum IgA a chain 
differs in one major aspect from its eutherian counter- 
parts: it lacks two cysteine residues in the CH2 domain 
and opossum IgA may thereby lack the second intra- 
chain disulfide bridge in the CH2 domain found in euthe- 



rian IgA (Fig. 6) [6-8]. The significance of this difference 
for thermal stability or other characteristics of the protein 
is, however, not yet known. The most striking observa- 
tion discerned from the comparison of the various e 
chain sequences presented in Fig. 2 is that rodent IgE 
lacks the second intra-chain disulfide bridge in the CHI 
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domain. Tliis intra-chain disulfide bond is present in all of 
the other sequences, including the marsupials (positions 
15 and 105). Furthermore, both mouse and rat IgE have 
substantially longer C-temninal ends compared to the 
other sequences (six and ten amino acids, respectively). 
Ttiese changes have therefore most likely occun-ed 
within a rodent ancestor after the major radiation of the 
placental mammals. 

The constant region of opossum IgE was found to con- 
tain four potential glycosylation sites (see Figs. 1 and 2). 
Only one of these sites, the second site in the CH3 
domain (position 282 in Fig. 2), is conserved among the 
various mammalian £ chain sequences, indicating that 
most of the glycosylation sites in mammalian IgE may be 
expendable. However, the site corresponding to position 
282 in Fig. 2 is conserved in other Ig classes as well, 
suggesting that glycosylation at this position is of impor- 
tance for the protein. This position is actually also con- 
served within the con-esponding sequence of the CH2 
domain of opossum IgG (Fig. 3). 

Many separate gene duplications have presumably 
occuned during the evolution of the five different Ig 
classes and subclasses that we see in different mam- 
mals of today. By studying the phylogenetic tree pre- 
sented in Fig. 7 for Ig evolution, we can see that most of 
these gene duplications leading to expansion of the 
number of isotypes, within a certain Ig class, have 
occurred relatively recently and as independent gene 
duplications within each separate mammalian lineage. 
The fact that many separate individual gene duplications 
have resulted in very similar end results is. however, 
striking. For example rodents as well as man and pig 
have four to five functional IgG genes. There are, how- 
ever, exceptions to this pattern. The rabbit and most 
likely also the opossum has only one IgG gene and the 
rabbit has at least 13 different genes for IgA. 

In contrast to IgG and IgA, only one IgE jsotype has been 
found in all mammals studied, including the marsupials, 
indicating that there is a selectable advantage of having 
only one gene for IgE and to have this gene under very 
stringent control. Because of the potentially hanmful 
effects of IgE. the concentration of plasma IgE has most 
likely been kept low by a strong control by T cells. During 
evolution this has probably been easier to maintain by 
having only one functional IgE gene. One striking exam- 
ple which supports this conclusion is found among the 
human Ig genes. A relatively recent gene duplication has 
led to the increase in IgG genes from two to four and an 
increase in the number of IgA and IgE genes from one to 
two. Most likely all four of these genes were contained 
within one single duplicated genomic fragment. Follow- 
ing this duplication all four IgG genes (IgGI to 4) and 
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both IgA genes (IgAI and lgA2) have been kept function- 
ally intact. However, the second gene for IgE has been 
functionally inactivated by a large deletion including 
parts of the switch region and a region covering the 
exons for constant domains one and two [9]. 

In summary, the identification of both IgG and IgE in the 
opossum shows that the duplication of IgY did occur 
more than 140-200 million years ago. The presence of 
only three constant domains In both opossum IgG and 
IgA also shows that alt the major changes in Ig structure 
that have occun-ed since the evolutionary separation of 
mammals from other early reptile lineages had occun-ed 
already before the evolutionary separation of marsupials 
and placental mammals. 

4 Materials and methods 
4.1 Design of PGR primers 

Based on previously published e chain sequences from 
mouse, rat. human, orangutan, chimpanzee, dog, horse, pig 
and sheep [10-9], two regbns of partrcularly high sequence 
homology were selected as target regions for synthesis of 
degenerate PGR primers, one in the N-terminal part of the 
C3 domain and one in the C-tenninal region of the C4 
domain. The primer sequences are: oligo 212-C3-lgE: GAC- 
GAATTCCCCAMRMYYACCTGYCTGGTKKTGGACGTGG: 
oligo 214-C4B-lgE: GATGGATeCACCTGYAGGGGFlCTR 
WAGRYGAAGAAG. 

Based on previously published y chain sequences from 
human IgGI. mouse IgGI, rat lgG2a. sheep lgG2. rabbit 
IgG. hamster IgG, and pig lgGl {20-27], two regions of par- 
ticularly high sequence homology were selected as target 
regions for synthesis of degenerate PGR primers, one in the 
C-terminal part of the 01 domain and one in the C-tenninal 
region of the 03 domain. The primer sequences are; oligo 
226-01 -IgG: GAGGAATTCACMMRSCCASCARCRCCA- 
ARGTGGAOAAG: oligo 227-C3-lgG: GGAGGATCCTRTRGT 
GGTrGTGOARRSCCTORTGYADCAC. 



4.2 PGR amplification 

Phage particles from approximately 300 ^1 of an amplified 
M. domestica total spleen cDNA library in X-ZAP (Strata- 
gene, U Jolla, OA) (3.7 x 10^ PFU/ml) were, after addition of 
700 SM [100 mM NaOI. 10 mM MgS04. 50 mM Tris 
pH 7.5 and 0.1 % (w/v) gelatin], precipitated by addition of 
0.1 g PEG 6000 and 60 mg NaCI. After the PEG and NaCl 
had dissolved, the sample was kept on ice for 1 h. The 
phages were then pelleted by centrifugation for 10 min at 
13000 rpm in an Eppendorf centrifuge. The pellet was dis- 
solved in 300 n) SM and then extracted once with phe- 
nol: chloroform (50:50) in the presence of approximately 
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6 mM EDTA, 0.06 M NaCI and 0.3 % SDS. followed by one 
extraction with chloroform. The DNA was precipitated with 
ethanot in the presence of 0.3 M sodium acetate and dis- 
solved in 100 (xl double-distilled water. A 5-^1 aliquot of the 
cDNA preparation was used as template in a standard PGR. 
The PGR was performed in 45 cycles consisting of 40 s at 
94 "G, 60 s at 60 X and 90 s at 72 *C in a MJ Research mini- 
cycler. 

4.3 isolation of full-length cDNA for opossum IgG and 
IgA (y and a chains) and a cDNA containing the 
entire constant region of opossum IgE e chain 

A cDNA library of approximately 2 x 10® recombinants was 
obtainiBd from Stratagene (LaJolla. CA). Approximately 
200000 plaques from this amplified cDNA library were 
spread as a monolayer of the E. coli XL1 -Blue MRF' strain 
with a titer of approximately 25000 clones/1 38-mm plate. 
The plaques were transferred to Hytx^nd N+ filters (Amer- 
sham Int., Amersham, GB). The filters were screened with 
the fragment derived from the cloned PGR products for 
opossum IgG y chain and IgE e chain, btkI subsequently 
washed at high stringency (0.1 x SSG, 0.1 % SDS). Positive 
plaques were purified, and the pBluescript phagemid of 
each clone was excised from the lambda ZAP-II vector by 
helper phage co-infection according to the protocol of the 
supplier of the lambda ZAP-II library (Stratagene), The differ- 
ent cDNA clones and the cloned PGR product were 
sequenced from both ends by the use of Sp6 and T7 pri- 
mers. Based on the obtained sequence, new sequence pri- 
mers were designed and used to complete the sequence of 
the entire insert. All nucleotide sequences presented in this 
study were established by sequencing both strands of the 
DNA insert. 



4.4 Amino acid sequence alignment 

The amino acid sequences of the constant regions from a 
panel of IgE, IgG, IgA, and IgY sequences were compared 
using the GLUSTAL W program [28]. A bootstrap tree based 
on 1000 independent calculations was constructed with this 
pix>gram (Fig. 7). 

The original references for the amino acid sequences are as 
follows: human IgE e chain [13, 14], mouse IgE e chain [10], 
rat IgE e chain [1 1,12], chimpanzee IgE e chain [1 5], Orangu- 
tan IgE £ chain [15], dog IgE c chain [17], horse IgE e chain 
[19], pig IgE [18], sheep IgE e chain [16], chicken IgA a chain 
[29], duck IgA a chain (unpublished Genbank accession 
number, APU27222), mouse IgA [30], rabbit lgAI-13 [31], 
dog IgA [17], pig IgA [32], macaque IgA [33], gorilla IgA [33], 
human IgAI and igA2 [34], axoloti IgY ^ chain [35], chicken 
IgY chain [36], duck IgY chain [37], human IgGI [20], 
human lgG2 [20. 38], human rgG3 [39], human lgG4 [38, 40], 
mouse lgG1 [21, 22], mouse lgG2a [41, 42], mouse lgG2b 
[43. 44], mouse lgG3 [45], rat lgG2a [23], rat lgG2b [23], rat 



lgG2c [46], sheep lgG2 [24], rabbit IgG [25], hamster IgG 
[26], and pig IgGI [27]. 
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To trace the emergence of the modern post-switch immunoglobulin (Ig) isotypes in verte- 
brate evolution we have studied Ig expression in mammals distantly related to eutherians. 
We here present an analysis of the Ig expression in an egg-laying mammal, a monotreme, 
the duck-billed platypus (Ornithorhynchus anatinus). Fragments of platypus IgG and IgE 
cDNA were obtained by a PCR-based screening using degenerate primers. The fragments 
obtained were used as probes to isolate full-length cDNA clones of three platypus post- 
switch isotypes, lgG1, lgG2, and IgE. Comparative amino acid sequence analysis against 
IgY, IgE and IgG from various animal species revealed that platypus IgE and IgG form 
branches that are clearly separated from those of their eutherian (placental) counterparts. 
However, the platypus IgE and IgG still conform to the general structure displayed by the 
respective Ig isotypes of eutherian and marsupial mammals. According to our findings, all of 
the major evolutionary changes in the expression array and basic Ig structure that have 
occurred since the evolutionary separation of mammals from the early reptile lineages, 
occurred prior to the separation of monotremes from marsupial and placental mammals. 
Hence, our results indicate that the modern post-switch isotypes appeared very early in the 
mammalian lineage, possibly already 310-330 million years ago. 
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1 Introduction 

The combinatorial immune system defined by bona fide 
immunoglobulins and T cell receptors seems to have 
emerged at the appearance of the jawed vertebrates 
[1-3]. Due to the absence of its components in inverte- 
brates as well as agnathans the combinatorial immune 
system has been proposed to have appeared in a rela- 
tively short period of evolutionary time, possibly in a sin- 
gle step catalyzed by horizontal transfer of recombina- 
tion genes from prokaryotes into the vertebrate genome. 
This event has been described as a "Big Bang" in molec- 
ular evolution [2, 3]. Since this "Big Bang", the complex- 
ity of the combinatorial immune system has increased 
gradually throughout vertebrate evolution. Bony and car- 
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tilaginous fishes express primarily IgM, whereas euthe- 
rian (placental) mammals express up to five Ig classes 
(IgM, IgD, IgG, IgE and IgA). Fish B cells apparently do 
not undergo isotype switching [4]. However, depending 
on the species, an IgD-like Ig, IgNAR, IgNARC, IgW or 
IgR (IgX) may also be expressed, either through differen- 
tial splicing (the IgD-like Ig) or their presence on loci sep- 
arate from that of IgM, reviewed in [4]. 

Amphibians, reptiles and birds all express IgY, the first 
confirmed post-switch isotype to appear in evolution [5, 
6]. Hence, the mechanism of isotype switching seems to 
have emerged with the tetrapods, or possibly in the 
putative ancestors of tetrapods, the lungfish or the coe- 
lacanth (Latimeria) [6]. Early studies in the lungfish have 
identified a second highly abundant serum Ig in addition 
to IgM that shows similarities with the truncated form of 
IgY found in ducks [7-9]. However, further studies in the 
lungfish and the coelacanth are required to establish at 
which evolutionary stage the first post-switch isotype did 
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appear. In addition, such analyses may help to resolve 
the origin of tetrapods [10]. 

IgY shares effector functions characteristic of both mam- 
malian IgG and IgE. Hence, an IgY-like ancestor probably 
duplicated sometime early in mammalian evolution and 
formed two separate Ig classes, IgG and IgE [5, 6]. 

Like IgE and IgM, IgY of birds and amphibians has four 
constant domains, whereas IgG in eutherian mammals 
and marsupials has only three. Sequence comparisons 
among different domains of IgY and IgG suggest that the 
second domain of the ancestral IgY was deleted in what 
evolved into the IgG of mammals [5]. A short hinge exon 
found in mammalian IgG seems to be the only remaining 
part of the original exon encoding the second constant 
domain in the ancestral IgY [5]. 

Frogs {Xenopus laevis) but not axoiotis, express one 
additional post-switch isotype called IgX (different from 
IgX of fish), that probably arose later than IgY [6]. Xeno- 
pus IgX appears to have similar distribution and effector 
functions as IgA in mammals and birds, however, it is 
probably not the direct homologue of mammalian IgA 
[11]. 

Most earlier studies of immunoglobulin expression in 
mammals have focused on eutherian mammals, where 
all investigated species express IgM, IgG, IgE and IgA. 
Some eutherian mammals, such as humans and mice, 
express IgD as well. Apart from the eutherians, there are 
two other extant mammalian lineages, the marsupials 
(metatheria) and the monotremes (prototheria). 

Marsupialsajg jepresented by more than 270 living spe- 
cies, whereas the monotremes, a group of egg-laying 
Ifiammals, are represented by onlyth ree, the duck-billed 
platypus and twoechi dna spec ies. The monotremes 
have been regarded as "reptile-like" or "primitive" mam- 
mals simply because they lay eggs. However, they do 
possess several of the major mammalian features includ- 
ing a well-developed fur coat, a single bone in the lower 
jaw, three bones in the middle ear and mammary giands. 
Histological studies show that the spleen, thymus and 
gut-associated lymphoid tissues in the platypus are well 
developed and comparable in histological structure to 
those in therian mammals (eutherians and marsupials) 
[12]. However, in sites where lymph nodes would be 
expected in therian mammals, monotremes were found 
to have lymphoid nodules that resemble the jugular bod- 
ies of the amphibians, which indicates that monotremes 
have a somewhat more primitive immune system [12]. 

The evolutionary separation of the marsupials from the 
placental mammals is generally dated back to approxi- 
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mately 130 to 175 million years ago (Belov et al., 
"echidna IgM", submitted for publication) [13, 14], 
whereas the major radiation of the placental mammals 
probably occurred 60 to 120 million years ago [14]. The 
evolutionary relationship between marsupials and mono- 
tremes has not yet been fully resolved. The Marsupio- 
nata hypothesis supported mainly by mitochondrial data 
and DNA-hybridization studies suggests that marsupials 
and monotremes are sister lineages [15-18]. The oppos- 
ing Theria hypothesis proposes that the separation of the 
monotremes from the marsupial and eutherian mammals 
occurred much earlier in evolution. The Theria hypothe- 
sis initially emerged from morphological data and has 
more recently gained additional support from paleonto- 
logical data [19, 20] and a number of nuclear gene 
sequences (Belov et al., "echidna IgM" and "echidna 
IgA", both submitted for publication) [13, 21-24]. The 
nucleotide substitution rate in mitochondrial DNA is gen- 
erally more rapid and the tendency for repeated muta- 
tions in the same site is higher than what is observed in 
nuclear DNA [25]. Consequently, time estimates based 
on mitochondrial data may be less accurate over large 
evolutionary distances. In fact, Springer and colleagues 
[25] showed that mitochondrial genes have less resolv- 
ing power than certain nuclear exons in resolving deep- 
level mammalian clades. Studies based on IgM (Belov et 
al., "echidna IgM", submitted for publication) and prot- 
amine sequences estimate the time of divergence of the 
monotremes from the other mammalian lineages to 
around 150 to 170 million years ago [13]. This estimate 
fits quite well with dating based on fossil records and 
anatomical features [19, 20, 26]. 

In a previous study of marsupial immun e functions, we 
found that the Americafr short-tailed ^gray ^possum 
expresses four Ig isotypes, IgM, IgG, IgE and IgA, which 
all conform to the basic structure of the corresponding 
isotypes in placental mammals [27], The findings in the 
opossum revealed that all of the major changes in the Ig 
isotype pattern that emerged after the separation of the 
mammals from the early reptile lineages had occurred 
prior to the separation of marsupials from placental 
mammals. Hence, the duplication of IgY and the deletion 
of the second constant domain in the ancestral IgG and 
IgA probably took place more than 1 30 million years ago. 

To further trace the evolution of modern post-switch iso- 
types and the appearance of the full range of functionally 
separated Ig isotypes found in therian mammals, 
referred to as "a second Big Bang" by Marchalonis and 
colleagues [3], we initiated a study of Ig isotypes in 
monotremes. As a first step, we describe here the clon- 
ing of IgE, lgG1 and lgG2 from the duck-billed platypus, 
Omithorhynchus anatinus. 
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2 Results 

2.1 Cloning and nucleotide sequence analysis of 
the £- and y-chains of platypus IgE, IgGI and 
lgG2 

A PCR-based cloning strategy was adopted to isolate 
cDNA clones encoding partial platypus IgE and IgG 
heavy chains. Nucleotide sequence comparisons of 
the e- and y-chain sequences from a panel of different 
mammals had been conducted earlier [27]. Based on 
these comparisons, two regions with particularly high 
sequence conservation within the respective isotypes 
were selected as target regions for synthesis of degener- 
ate oligonucleotide primers. The degenerate primers 
were used in PGR amplifications using double-stranded 
cDNA from Australian platypus splenocytes as template. 
This resulted in the isolation of one partial cDNA clone 
each for platypus IgE and IgG. The clone generated with 
the E-chain primers was approximately 470 base pairs 
(bp), encoding a region stretching from the GH3 to GH4 
central regions of the e-chain. The partial IgG clone was 
approximately 540 bp stretching from the N-terminal 
part of the CH2 domain to the C-terminai part of the CH3 
domain. 

The partial platypus IgE and IgG cDNA clones were used 
as probes to screen a platypus spleen cDNA library. 
Screening of about 1 30, 000 plaques (1 6,250 plaques/fil- 
ter) resulted in an average of 1 6 and 11 5 positive signals/ 
filter using the e- and y-chain probes, respectively. Four 
positive clones, two from each screening, were selected 
for further analysis. The complete nucleotide sequences 
of the e- and y-chains were determined from one full- 
length clone of the respective isotypes (GeneBank 
AY055780 and AY055781). 

Upon further screening of the library using a full-length 
IgG clone as probe, two clones for a novel y-chain iso- 
type were isolated and sequenced, together with nine 
additional clones of the previously isolated y-chain iso- 
type. The initially isolated y-chaIn and the novel y-chain 
isotype were designated IgGI and lgG2, respectively. 
The complete nucleotide sequence of one full-length y2- 
chain was determined {GeneBank AY055782). 

The amino acid sequence of the variable domains of the 
full-length e, y1 and y2 clones are shown in Fig. 1 and the 
amino acid sequence of the constant domains of IgE and 
the two IgG isotypes are shown in Fig. 2 and 3, respec- 
tively. 

Following the isolation of full-length clones for IgGI, 
lgG2 and IgE, the platypus cDNA library was screened 



with a variable region (V-region) probe. Based on previ- 
ous screenings using IgGI, lgG2 and IgE constant- 
region-specific probes, IgG and IgE negative V-region 
positive clones were isolated. To avoid abolition of 
closely related isotypes the filters were washed at high 
stringency following all constant region hybridizations. 
Thirteen of the V-region-positive, y- and e-chain- 
negative clones were subcloned and sequenced using a 
primer directed against the region spanning the J seg- 
ment. All of these thirteen clones were clearly identified 
as a-chains of two different IgA-like isotypes: three a1- 
chains (IgAI) and ten a2-chains (lgA2). Hence, no addi- 
tional IgG or IgE isotypes were identified and we failed to 
identify any IgM or IgD like isotypes. IgM is present in all 
mammals investigated so far, including another mono- 
treme, the Echidna (Belov et ai., submitted for publica- 
tion), and therefore likely to be present also in the platy- 
pus. However, a similar screening strategy was used to 
isolate the various Ig isotypes expressed by the opos- 
sum {Monodelphis domestics) [27, 28] and In this screen- 
ing 9 out of 12 IgG- and IgE-negative clones were identi- 
fied as IgM, whereas the remaining 3 turned out to be IgA 
(unpublished observations). Therefore, our failure to 
identify IgM in platypus is an intriguing subject for ongo- 
ing investigation. 

The complete primary structure of platypus IgAI and 
lgA2 will be presented elsewhere (manuscript in prepara- 
tion). 

2.2 Sequence comparisons and structural 
analysis of the e- and y-chains of the 
platypus IgE and IgG 

The complete amino acid sequences of the constant 
regions of platypus IgE, IgGI and lgG2 were compared 
with amino acid sequences of the corresponding isoty- 
pes of other mammals (Fig. 2 and 3). The overall struc- 
tures of the platypus IgE and IgG conform to the struc- 
tures of the previously characterized eutherian and mar- 
supial IgE and IgG. 

The amino acid sequence of platypus IgE reveals that 
the cysteine residues involved in the various disulfide 
bridges that are conserved in most mammalian IgE have 
been conserved also in platypus IgE. Furthermore, both 
of the putative inter-chain disulfide bridges present in the 
CH2 domain of IgE in all species analyzed so far are con- 
served also in platypus IgE (Fig. 1 and 2). Five potential 
N-linked glycosylatton sites were identified in the con- 
stant region of platypus IgE: two in the 02 domain, one in 
the 03 domain and two in the 04 domain (Fig. 2). The 
site in the CH3 domain is conserved among all e-chain 
sequences isolated so far, suggesting that glycosylation 



21 48 M . Vernersson et al. 



Eur. J. Immunol. 2002. 32: 2145-2155 



VH C»<t CH2 CBS CH4 



Platypus IgE ^j ^^ ^ff^"^ ^^^ 



-AAAAA 



; f^SALSLLSILTLLQGVQG DVQLVESGG0VRa^G6SLSLSCKVSGFTlNltMvJ^Q<^^g^^ 

:il»WiiNFT!SRONSKSLtNLHMSSLKTEfiTALYYCARgg@SfMS®&fiP 

• ''""■^ C0R3 ---- 



S VH 



CHI Hinge CH2 



CHS 



Platypus IgGi - | § M f^tii^^i^m^^n 

y .S^',ZZlzzzzzz:s^». 

I f^SAFSLLSVLTLtQGVRG0 VQLV£$6ePtRQPG6SLRL3C^ASAGTFSi6;MWRQAP6K6LEVIAgiCTl 
iKSM^f^J^nSKDNANSLtHLQilRSLNTGDIARYVC^ia^ 



s m 
Platypus lgG2 HI 



CHI Hlr^d CH2 



CHS 




AAAAA 



L H ^'"^ 



; HQSAFSLlSVLTLLQGVQSO VQLVESaGDLftO(>GGSLHUSCKASGFSFsi^Swft<>APGKGLEVV^jij^ 
i gl^^lHRFTISKDNANFLVHLar^NSLKTEDTALVVC^liP^^ 

_PE^_.. i^apW 



Fig. 1. Schematic maps (in scale) of the platypus e, y^ and y2 heavy-chain clones and the amino acid sequence of the variable 
domains encoded by the V, D and J gene segments of the respective full-length clones. The signal sequences (S) are represented 
in black, the variable domain (VH) is represented in white with light gray regions denoting the positions of the complementary- 
determining regions (CDR), the constant domains are illustrated by striped boxes and the hinge region (in IgG) is indicated in dark 
gray. Brackets indicate the positions of putative intra-chain disulfide bridges. Cysteine residues involved in putative inter-chain 
disulfide bridges connecting the two heavy chains and the heavy chain with the light chain are denoted by H and L, respectively. 
The amino acid sequences of the variable domains are shown below the schematic maps. A small arrow marks the position of 
the signal sequence cleavage site and the amino acid residues corresponding to CDR1 , 2 and 3 are indicated by light gray boxes. 
CDR borders were set according to Kabat et al. [29J. 



at this position Is of importance for protein structure and/ 
or solubility (position 276 in Fig. 2). 

Like the previously characterized eutherian and marsu- 
pial IgG, the heavy chains of the platypus IgG isotypes, 
lgG1 and lgG2, contain three constant domains (Fig. 1 
and 3). However, although the platypus IgG conform to 
the general IgG structure, some important differences 
were observed. For instance, in platypus IgG2 the cyste- 
ine residue forming the disulfide bridge that connects the 
heavy (H) chain with the light (L) chain is positioned in the 
C-terminal region of the CH1 domain (Fig. 1 and 3). In 
contrast, in most, if not in all, previously characterized 
mammalian IgG molecules, this cysteine residue occurs 
in the N-terminal part of the CH1 domain (Fig. 3). One 
potential inter-chain disulfide bridge-forming cysteine 



residue is found in the hinge region of platypus IgGI , 
whereas two cysteines are present in the same region of 
lgG2 (Fig. 1 and 3). On the whole, the two IgG isotypes 
found in platypus display a remarkable divergence in 
sequence compared to IgG isotypes of other mammalian 
species. 

Sequence analysis of the V-regions also revealed some 
interesting features. In addition to the canonical cysteine 
residues, the V-regions of both the lgG2 and the IgE 
clone have two extra cysteine residues located in 
the complementarity-determining region (CDR)2 and/or 
CDR3 loop structures (Fig. 1). Considering the high reac- 
tivity of thiol groups and the structural flexibility of the 
loop structures, additional intra-chain disulfide bridges 
probably occur in these V-regions. Moreover, the addi- 
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Fig. 2. Amino acid sequence alignment of a selected panel of e-chains. The top sequence is generated by the amino acid resi- 
dues that are represented in the majority of the sequences. Residues that match with the consensus sequence are Indicated by 
a dash. The positions of the exon/intron boundaries {based on sequences in which gene structure has been determined) are indi- 
cated by arrowheads above the sequence. All cysteine residues involved in putative inter-chain and intra-chain disulfide bridge 
formations are marked by asterisks and the intra-chain disulfide bridges are indicated by brackets. Cysteine residues involved in 
the formation of inter-chain disulfide bridges connecting the heavy chains and the heavy chain with the light chain are denoted 
by H and L, respectively. 



tlonal cysteine residue in the CDR2 of platypus lgG2, 
residue 52 according to [29], appears to face the CDR3 
[30]. A similar arrangement of cysteine residues in the 
heavy chain V-regions of the llama was suggested to be 
compatible with a disulfide bridge between CDR2 and 
CDR3 [31]. The presence of intra-CDR3 disulfide bridges 
has been indicated in the heavy chain V-regions of shark 
[32], cow [33] and members of Camelidae [31, 34]. 
Hence, a disulfide bridge may form between the CDR2 
and CDR3 of the lgG2 clone and an intra-CDR3 cysteine 
bridge probably occurs in the IgE clone (Fig. 1). A more 
detailed analysis of the V-gene repertoire in platypus 
revealed that additional cysteine residues that form puta- 
tive disulfide bridges, either between the CDR2 and 
CDR3 loop structures or within the CDR3, were found in 
9 of 25 clones analyzed [35]. 

A phylogenetic tree was generated by comparing the 
complete amino acid sequence of the platypus IgE and 



IgG constant regions with those of a selection of the 
presently available mammalian IgE and IgG sequences, 
along with two IgY sequences from birds (Fig. 4). The 
platypus IgE and IgG form separate branches, outside of 
all the eutherian IgE and IgG. Strikingly, the branch for 
the platypus IgG isotypes occurs outside of both mar- 
supial and eutherian IgG, whereas the platypus IgE 
sequence indicates that monotremes and marsupials are 
sister lineages. 



2.3 Relative expression levels of platypus IgE, 
lgG1 and lgG2 in the spleen of a free-living 
Tasmanian platypus and Northern blot 
analysis 

The relative expression levels of the various isotypes in 
the spleen of a free-living platypus were estimated by 
screening the non-amplified spleen cDNA library with 
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Fig. 3. Amino acid alignment of a selected panel of y-chain sequences. The top sequence is generated by the amino acid resi- 
dues that are represented in the majority of the sequences. Residues that match with the consensus sequence are indicated by 
a dash. The positions of the exon/intron boundaries (based on sequences in which gene structure has been determined) are indi- 
cated by arrowheads above the sequence. All cysteine residues involved in putative inter-chain and intra-chain disulfide bridge 
formations are marked by asterisks and the intra-chain disulfide bridges are indicated by brackets. Cysteine residues involved in 
the formation of inter-chain disulfide bridges connecting the heavy chains and the heavy chain with the light chain are denoted 
by H and L, respectively. . 



tsotype specific probes. No cross-hybridization to other 
isotypes was observed with the isotype-specific probes. 
The screenings revealed that IgE, lgG1 and lgG2 are 
expressed at relative levels of 10, 60 and 1. Hence, lgG1 
seenns to be the predominant plasma IgG isotype in the 
platypus, whereas lgG2 is much less abundant. lgG1 is 
expressed at six times the level of IgE and the expression 
level of lgG2 is only one tenth of that observed for IgE. 
However, It should be emphasized that these values are 
based on the relative abundances of mRNA and need 
not reflect the serum levels of the various Ig isotypes. 

None of the 28 analyzed Ig clones contained exons cor- 
responding to the membrane region, indicating that the 
expression levels of the membrane-bound forms are 
comparatively low. A Northern blot analysis using 
isotype-specific probes was performed to estimate the 
relative abundance of transcripts encoding membrane- 



bound versus secreted forms of the platypus IgE, IgGI 
and lgG2 isotypes, respectively. However, no transcripts 
encoding the membrane-bound forms of the three isoty- 
pes were detected (data not shown). Based on our 
results we estimate that in the spleen less than 5% of the 
total mRNA pool for each of the respective isotypes cor- 
responds to the membrane-bound forms, which appears 
to be in agreement with the situation in other mammals. 

3 Discussion 

We have demonstrated the presence of IgE and two 
separate IgG isotypes, IgGI and lgG2, in a monotreme 
Ornithorhynchus anatinus, the duck-billed platypus. 

Sequence comparisons of the platypus isotypes and a 
selection of the presently available mammalian e- and y- 
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Fig. 4. A comparative analysis of the amino acid sequence of the entire constant region of a selected panel of mammalian IgE 
and IgG sequences and two IgY sequences from birds presented in the form of a phylogenetic tree. The bootstrap tree is based 
on 1,000 independent comparisons and the bootstrap values are added at each branch point. The sequence distance is repre- 
sented by the length of the branches according to the scale at the bottom of the figure, which corresponds to a sequence diver- 
gence of 5%. 



chain sequences indicate that only minor changes in the 
overall structures of IgE and IgG have occurred since the 
evolutionary separation of the monotremes from the 
other mammalian lineages (Fig. 2 and 3). Apart from sev- 
eral small insertions and deletions (1-4 full codons) and a 
few changes in the positions and numbers of disulfide 
bridges and putative glycosylation sites, few changes 
other than single point mutations have occurred. How- 
ever, due to the relatively high number of point muta- 
tions, the divergence among the various primary nucleo- 
tide and amino acid sequences is substantial (Fig. 2 
and 3). 

The most striking differences among the various e-chain 
sequences in Fig. 2 occur in rodent IgE, which has lost 
one of the intra-chain disulfide bridges in the CH1 
domain (position 15 and 103) and has substantially lon- 



ger C terminus than all other mammalian IgE. These 
characteristic features probably appeared in a common 
rodent ancestor after the major radiation of the placental 
mammals. Interestingly, the recent finding that this intra- 
chain disulfide bridge is absent also in a marsupial 
e-chain, that of the Australian brushtail possum, sug- 
gests that it has been lost at least twice during mam- 
malian evolution [36]. 

The mRNA level for IgE was surprisingly high in the platy- 
pus spleen. Only a sixfold difference in mRNA levels was 
observed between the predominant serum antibody in 
platypus, IgGI, and IgE. This is intriguing, since, in gen- 
eral, the serum concentration of IgE is significantly lower 
than that suggested by the mRNA level. In human adults, 
the average serum level of IgG is 9.5-12.5 mg/ml, 
whereas IgE is present at concentrations that are 30,000 



21 52 M. Vernersson et al. 



Eur. J. Immunol. 2002. 32: 2145-2155 



times lower (10-400 ng/ml). The mRNA levels presented 
here are based on studies in a free-living platypus. 
Hence, increased expression of IgE as a result of para- 
sitic infections may account for some elevation in the 
mRNA level. Furthermore, in humans, the turnovers of 
IgG and IgE differ quite remarkably. Serum IgG has a 
half-life of about 3 weeks (23 days), whereas the half-life 
of IgE in the circulation has been estimated at 2-2.5 days 
[37]. The expression level of IgE is thus much higher than 
can be expected on the basis of serum levels alone. 

Several findings Indicate a common phytogenic history 
of the avian and amphibian v-chains and the mammalian 
e- and y-chains [5, 6, 9]. In analogy with IgG in mammals, 
IgY is the major serum antibody in birds and amphibians, 
whereas with regard to structure (the number of exons 
and inferred positions of disulfide bridges) IgY is more 
similar to mammalian IgE, Comparison studies of the 
trans-membrane exons of the v in duck and the corre- 
sponding regions of mammalian y and e genes revealed 
several shared characteristic features [9]. In similar stud- 
ies of Xenopus IgY no particular similarities with either 
the duck v or the mammalian y and e trans membrane 
exons were observed. However, conserved residues in 
the trans-membrane and cytoplasmic domains that are 
absent in other isotypes were identified [6]. The authors 
conclude that Xenopus IgY, duck IgY and mammalian 
IgG and IgE probably all share a common precursor and 
that this IgY-iike ancestor arose by a gene duplication of 
IgM that occurred before the anurans or even the 
amphibians appeared [6]. 

Although IgG is the predominant serum Ab in mammals, 
the role of regulating mast cell degranulation falls mainly 
on IgE, an antibody that is present at low concentrations. 
The separation of the effector functions of an IgY-like 
ancestor into two separate isotypes, IgG and IgE, may 
therefore have been of selective advantage. Further- 
more, the conservation of the overall structure of IgE is 
strongly indicative of an evolutionary advantage of main- 
taining the function of the IgE mast cell system. In some 
mammalian species where mast cells are triggered by 
both IgE and IgG, as in the guinea pig, a second immuni- 
zation with an antigen can result in massive anaphylactic 
reactions [38]. In contrast, in species where mast cells 
are triggered exclusively by IgE, mast cell degranulation 
is normally kept under more stringent control. So far, only 
one IgE isotype has been found in all mammalian spe- 
cies studied, now including the platypus. Hence, restrict- 
ing regulation of mast cell degranulation to one isotype 
and to have only one gene for this isotype may have a 
selective advantage. The potentially very high turnover of 
IgE raises questions regarding the role of this isotype in 
the inflammatory response. Does IgE function as a 
"doorkeeper", scanning the antigen repertoire of the 



environment and preparing the individual for potentially 
harmful pathogens [39]? A rapid turnover may then more 
adequately prepare the individual to deal with its present 
environment. 

Like IgG in other mammals, both of the IgG heavy chains 
found in platypus (yl and y2) have only three constant 
domains (Fig. 1 and 3). Hence, the second domain in the 
Ig ancestral to platypus lgG1 and lgG2 was apparently 
deleted prior to the evolutionary separation of mono- 
tremes from the other mammalian lineages. Alternatively, 
two or three independent deletions occurred, one in 
each of the three major mammalian lineages (or one in 
the common ancestors of marsupials and placental 
mammals and one in monotremes). However, the high 
overall structural similarity of the platypus IgG to marsu- 
pial and placental IgG strongly favors the first alternative. 

In mouse, rat, pig and human the different IgG isotypes 
are relatively closely related within a single species. In 
contrast, the sequence divergence between the two 
platypus IgG isotypes is remarkably high (Fig. 3). In fact, 
the sequence divergence between the two IgG isotypes 
in platypus is actually greater than that between the 
mouse and human IgG isotypes (Fig. 4), indicating that 
the two platypus IgG isotypes did appear as separate 
isotypes before the major radiation of placental mam- 
mals 60-120 million years ago. Although quite specula- 
tive, the distances in the phylogenetic tree presented in 
Fig. 4 suggest that the platypus IgG isotypes were 
formed by a gene duplication shortly after the separation 
of monotremes from the common mammalian ancestor, 
possibly as long as 150-170 million years ago. The dif- 
ferent IgG isotypes in rodents, pigs and humans may 
instead have arisen from relatively recent duplications 
within each separate subgroup of eutherians (Fig. 4). 

The high degree of sequence divergence between the 
two platypus IgG isotypes is also reflected in several 
structural differences. For instance, the position of the 
cysteine residue involved in the putative disulfide bridge 
connecting the heavy chain with the light chain differs 
between the two isotypes. In lgG1 , this cysteine residue 
is located in the N-terminal region of the CH1 domain, 
whereas in lgG2 it is located in the G-terminal region of 
the same domain (Fig. 1 B and G). To our knowledge, the 
latter position occurs only in the platypus lgG2 and is 
thus unique among mammalian IgG molecules. The N- 
terminal cysteine residue in the CH1 domain of mouse 
lgG1 has also been lost, but it is not replaced by a cyste- 
ine residue in the G-terminal region (Fig. 3). Hence, the 
heavy chain of mouse lgG1 seems to rely entirely on 
non-covalent interactions with the light chain, a feature 
that has been observed in other Ig isotypes as well. For 
example, disulfide bridges between the heavy and light 
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chains are absent in the IgM-like Ig and the two low 
molecular weight Ig isotypes of the bullfrog (Rana cates- 
beiana) [40]. 

The two platypus IgG isotypes also differ with respect to 
the number of potential N-Iinked glycosylation sites. 
Platypus lgG2 has four potential N-linked glycosylation 
sites, whereas lgG1 has only one site located in the C2 
domain {Fig. 3). However, the lone N-linked glycosylation 
site in IgGI Is conserved between the two platypus iso- 
types and among other Ig isotypes as well, including 
opossum IgG (Fig 3). In fact, this site corresponds to the 
site in the E-chain that is conserved among all IgE 
molecules isolated so far (position 276 in Fig. 2). 

In summary, the primary structures of the cloned platy- 
pus y1-, y2- and e-chains suggest that all the major evo- 
lutionary changes that gave rise to the Ig isotypes 
expressed by present day mammals emerged prior to 
the separation of monotremes from marsupials and pla- 
cental mammals. Furthermore, the presence of three 
constant domains in both of the highly divergent IgG iso- 
types found in platypus supports the conclusion that 
deletion of the second domain of IgG occurred very early 
in mammalian evolution. The results from the phyloge- 
netic analysis of platypus IgE and IgG are not conclusive 
with respect to the evolutionary relationship between 
monotremes and marsupials, indicating that further 
comparative studies of a large panel of nuclear genes are 
required to fully resolve the relationship (Fig. 4). How- 
ever, the platypus IgG sequences and several other unre- 
lated nuclear gene sequences, echidna IgM (Belov et al,, 
submitted for publication), echidna IgA (Belov et al., 
submitted for publication), protamine, neutropin, 
a-lactalbumin, p-globin and mannose-6-phosphate/ 
insulin-like growth factor II receptor (M6P/IGF2R) all sup- 
port the Theria hypothesis [13, 21-24], whereas it is 
opposed only by platypus IgE and two dental genes, 
amelogenin and dentin matrix protein 1 [41, 42]. Esti- 
mates based on protamine and IgM (Belov et a!., submit- 
ted for publication) sequences indicate that the mono- 
tremes separated from the ancestors of marsupials and 
placentals about 150-170 million years ago [13], 
whereas the separation of early mammalian ancestors 
from early reptiles may have occurred as far back as 
310-330 million years ago [14], Provided that the Theria 
hypothesis is correct, this leaves us with an evolutionary 
window for the appearance of modern post-switch isoty- 
pes of about 150 million years. Unfortunately, no more 
living groups of mammals exist today and we cannot 
obtain information regarding individual molecules from 
the fossil record. However, further clues may be gained 
from other molecules that have co-evolved with the vari- 
ous antibody isotypes. For this purpose, we have initi- 
ated a study of the high affinity receptor for IgE, FceRI, in 



mammals distantly related to placentals. At present, 
however, we are only able to speculate based on avail- 
able Ig sequences. Hence, the modern post-switch iso- 
types may have appeared in the early mammalian ances- 
tors essentially in their present form as early as 310-330 
million years ago, shortly after the split from the early 
reptile lineages. 



4 Materials and methods 

4.1 Primer design and PCR amplification 

Partial cDNA clones of the platypus e- and y-chains were 
isolated by PCR using degenerate primers. The design and 
sequence of the primers have been described previously 
[27]. A 1 0-nl aliquot of the same double-stranded cDNA that 
was used to generate the platypus spleen cDNA library was 
used as template in the PCR amplifications. The PCR ampli- 
fications were carried out in 100 \i\ under the following con- 
ditions: 50 mM KCI, 10 mM Tris-HCI pH 9.0, 0.1% Triton X- 
1 00, 1 .5 mM MgCb, 0.25 mM each of the four nucleotide tri- 
phosphates, approximately 1 \\.g of each of the two degener- 
ate oligonucleotide primers and 1 unit of Taq DNA polymer- 
ase (Promega, Madison. Wl). The reactions were initiated 
with a 2-min incubation at 90°C followed by 45 cycles of 
40 s at 94°C, 60 s at 52°C and 90 s at 72°C in an MJ 
Research minicycler (MJ Research, Inc., MO). The PCR 
products were purified from 1 % low-melting agarose gels 
and cloned into the EcoRl and BamHI sites of Bluescript 
SK(+/-) (Stratagene, La Jolla, CA). Purified e- and 7-chaln 
fragments were labeled with ^^P by random priming (Mega- 
prime, Amersham Int., GB) and used as probes to screen the 
platypus cDNA library. 



4.2 Isolation of full-length cDNA for platypus IgE, lgG1 
and lgG2 (t-and y chains) 

Total cellular RNA from platypus splenocytes was isolated 
by the guanidium thiocyanate method as described previ- 
ously [43] and poly(A)^ RNA was purified using oligo dT- 
coupled magnetic beads according to the manufacturer's 
protocol (PolyATtract, mRNA Isolation System II, Promega). 
Double-stranded cDNA was synthesized using the time- 
saver cDNA synthesis kit (Pharmacia-Biotech, Uppsala, 
Sweden). The cDNA was llgated into the single EcoRI site of 
the X-gt 10 vector and the recombinant vectors were pack- 
aged into phage particles using an in vitro packaging system 
(Stratagene). Approximately 130,000 plaques from this 
unamplified cDNA library were spread as a monolayer of the 
E coif C600 Hfl strain with a titer of approximately 17.000 
plaques/plate on eight plates. The plaques were transferred 
to Hybond N+ filters (Amersham Int.). Five of the filters were 
screened with the ^^p-iabeled, platypus e-chain fragment 
and three filters were screened with the partial y-chaln 
clone. The filters were washed at high stringency (O.lxSSC, 
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0.1 % SDS). Autoradiography was performed for 24-48 h on 
Kodak Exomat AR film (Eastman- Kodak Company, Roches- 
ter, NY). Positive plaques were purified and the inserts were 
excised by EcoRI and/or NotI cleavage and subcloned into 
Bluescript SK{+/-) (Stratagene) or a pGEM-2 vector with a 
modified multiclonlng site. The obtained cDNA clones were 
sequenced from both ends using T7 and T3 or Sp6 primers. 
Based on the obtained sequences, new sequence primers 
were designed and used to complete the entire insert 
sequences. In order to identify additional isotypes, the 
library was screened with a full-length 7-chain clone (IgGI : 
11). Novel signals (negative for the e- and y-chain fragment 
probes) were subcloned and sequenced as above. AH nucle- 
otide sequences presented in this study were established by 
sequencing both strands of the DNA insert. 

4.3 Amino acid sequence alignment 

The amino acid sequences of the constant regions from a 
panel of IgE, IgG and IgY sequences were compared using 
the CLUSTAL W program. This program performs progres- 
sive pair-wise comparisons in order of divergence, initiating 
with the most closely related sequence [44]. The neighbor- 
joining method was used to construct the evolutionary tree 
and to estimate the individual branch lengths. A bootstrap 
tree based on 1,000 independent calculations was con- 
structed using this program (Fig. 4). 

The GenBank accession numbers for the various Ig amino 
acid sequences are listed below. 

IgY: duck (X65219) and chicken (X07174). 

IgG: platypus 1 (AY055781), 2 (AY055782); rat 2a (M28669), 
2b (M28671), 2c {X07189); mouse 1 (J00453), 2a (J00470), 
2b (J00461), 3 (J00451); hamster (U17166); rabbit {L29172); 
sheep 2 (X70983); pig 1 (U03778), 2a (U03779). 2b {U03780), 
3 {U03781), 4 (U03782): human 1 (J00228). 2 (J00230), 3 
P(03604), 4 (KOI 31 6); possum (API 91 648) and opossum 
(AF035195). 

IgE: platypus (AF055780), possum (AF157617), opossum 
(AF035194), rat (K02901), mouse (X01857), sheep (M84356), 
pig (U96100), horse (U15150), dog (L36872), orangutan 
(Ml 5399), human (L00022) and chimpanzee (Ml 5398), 

4.4 Northern blot analysis 

Poly(A)^ RNA derived from platypus splenocytes was iso- 
lated as described above. Aliquots of approximately 2 ng of 
this RNA were loaded in separate lanes on an 1 .0% agarose 
gel containing 0.2 M formaldehyde. The fractionated RNA 
was blotted onto Hybond-N+ filters using 0.05 M NaOH as 
transfer solution. The RNA was fixed to the filter by UV 
cross-linking at 150 mjoule and hybridized overnight at QS^C 
in 6xSSC, 0.5% SDS, 3xDenhardt*s solution, 2 mM EDTA 



and 100 t^g/ml yeast RNA. Random primed ^^P fragments 
of 450-750 bp derived from IgGI, lgG2 and IgE clones, 
respectively, were used as probes. The filters were washed 
at high stringency (0.1 xSSC, 0.1 % SDS) prior to autoradio- 
graphy. As an internal control of the amount of RNA loaded 
in each lane, one of the filters was rehybridized with a pla- 
typus actin fragment (data not shown). 
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